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Abstract. Let M be a closed, orientable hyperbolic S-manifold such that (1) Hi{M ; Z.,) = 
for s = 2, 3, 7. Suppose that (2) M is non-Haken, or more generally that it has integral 
traces. If the trace field of M is quadratic then 0.395 is a Margulis number for M . If the 
trace field is cubic then 0.3 is a Margulis number for M. 

O 

If K is any number field, then for all but finitely many closed, orientable hyperbolic 
CS| 3-manifolds M which satisfy (1) and (2) and have trace field K, the number 0.183 is a 

Margulis number for M . Furthermore, if K is any number field, there is a real number e 
O with < e < 0.3, having the following property. Let M be any closed hyperbolic 3-manifold 

which satisfies (1) and (2) and has trace field K. Then about every primitive closed geodesic 
in M having length I < e there is an embedded tube having radius R{1), where R{1) is an 
explicitly defined function such that sinh^ R{1) is asymptotic to (. 01869. 

o 

S 



1. Introduction 



Let M be a closed hyperbolic n-manifold. We may write M = El^/r where T < Isom+(]HI") is 
discrete, cocompact and torsion-free. The group T is uniquely determined by M up to conjugacy 
in Isom_|_(]H"). We define a Margulis number for M, or for T, to be a positive real number with 
the following property: 

^ 1.0.1. If P is a point of H", and if xi and X2 are elements of T such that d{P,Xi ■ P) < fi for 

i = 1,2, then x\ and X2 commute. 

(N 

Here d denotes the hyperbolic distance on H"'. 

The Margulis Lemma [7, Chapter D] implies that for every n > 2 there is a positive constant which 
^ is a Margulis number for every closed hyperbolic n-manifold. The largest such number, n{n), is 

*^ called the Margulis constant for closed hyperbolic n-manifolds. 

^ Margulis numbers play a central role in the geometry of hyperbolic manifolds. If u is a Marguhs 

number for M then the points of M where the injectivity radius is less than fi/2 form a disjoint 
union of "tubes" about closed geodesies whose geometric structure can be precisely described. 
Topologically they are open (n — l)-ball bundles over S^. This observation and the Margulis 
Lemma can be used to show, for example, that for every V > there is a finite collection of 
compact orientable 3-manifolds Mi, . . . , Mjv, whose boundary components are tori, such that every 
closed, orientable hyperbolic 3-manifold of volume at most V can be obtained by a Dehn filling of 
one of the Mj. 

The value of fi(3) is not known; the best known lower bound, due to Meyerhoff [19], is 0.104. . .. 
Marc Culler has informed me that the Margulis number for the Weeks manifold appears to be 
0.774 . . ., which is therefore an upper bound for /u(3). 
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Under suitable topological restrictions on a hyperbolic 3-manifold M, one can give lower bounds 
for the Margulis number that are much larger than 0.104. For example, it follows from the results 
of [11], combined with those of [9] or [2], that if M is closed and orientable and 7ri(M) has no 

2- generator finite-index subgroup, then log 3 is a Margulis number for M. 

The theme of the present paper is that new estimates for Margulis numbers can come from number- 
theoretical restrictions on a hyperbolic 3-manifold. Before explaining this further I must give a few 
definitions. 

Throughout this paper I will fix a surjective (continuous) homomorphism 11 : SL2(C) Isom-|_(]H'^) 
with kernel {±/}. If M is a closed hyperbolic 3-manifold, and if we write M = H^/T as above, 
then r = n~-^(r) is also uniquely determined by M up to conjugacy in Isom_|_(]HI^); in particular 
the set of all traces of elements of T is uniquely determined. According to [18, Theorem 3.1.2], 
these traces generate a finite extension of Q, called the trace field of M, or of T. 

I will say that M, or T, has integral traces if the traces of elements of T are all algebraic integers. 
For example, it follows from the proof of [23, Proposition 5] that if the closed orientable hyperbolic 

3- manifold M is not a Haken manifold, then M has integral traces. Note also that if M has integral 
traces then any finite-sheeted covering M of M also has integral traces (although M may well be 
a Haken manifold if M is non-Haken). 

The Margulis number of a hyperbolic 3-manifold M is a quantity associated with its geometric 
structure. In the previous literature, while number-theoretic invariants such as the trace field 
have interacted with topological properties of M such as its commensurability class [18], there 
seems to have few interactions if any between such number-theoretic invariants and quantitative 
geometric properties of M. In this paper I will show that restrictions on the trace field give stronger 
information than is otherwise available about the Margulis number and related quantities. 

I will illustrate this by stating the following four theorems, which are applications of the methods 
of this paper. The first two concern the case where the trace field is of low degree. 

Theorem 1.1. Let M be a closed orientable hyperbolic 3-manifold having integral traces, such that 
Hi{M]'Ls) = for s = 2,3,7. Suppose that the trace field ofV is a quadratic number field. Then 
0.3925 is a Margulis number for T. 

This will be proved in Section 9. 

Theorem 1.2. Let M be a closed orientable hyperbolic 3-manifold having integral traces, such that 
Hi{M]'Ls) = for s = 2,3,7. Suppose that the trace field of M is a cubic field. Then 0.3 is a 
Margulis number for M. 

This will be proved in Section 10. 

For examples of closed, orientable hyperbolic 3-manifolds having quadratic and cubic trace fields, 
see [18, Appendix 13.5]. 

The next two theorems concern trace fields of arbitrary degree. 

Theorem 1.3. If K d C is an arbitrary number field, there is a real number e = ex with < 
e < 0.3, having the following property. Let M be any closed hyperbolic 3-manifold which has trace 
field K, has integral traces and satisfies Hi{M; Z^) = for s = 2,3, 7. Let C be a primitive closed 
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geodesic in M having length I < e. Then there is an embedded tube about C having radius R, where 
R is defined by 



(In particular C is embedded.) 

Theorem 1.3 is closely related to Margulis numbers, although it does not explicitly involve them. 
Indeed, Proposition 11.4, which we state and prove in Section 11, implies that if M is a hyperbolic 
3-manifold having (log3)/3 as a Margulis number, then any closed geodesic of length I < (log3)/3 
in M is the core of an embedded tube of radius R, where R is defined by (1.3.1). Note that as 
/ —>■ the right hand side of (1.3.1) is asymptotic to A/l, where A = \/3(cosh((log 3)/3) — 1)/ (27r) = 
0.01869. . .. In contrast, if one uses Proposition 11.4 and the Margulis number 0.104 to obtain a 
lower bound R' for the embedded tube radius about a closed geodesic of sufficiently small length I, 
then sinh^ R' is asymptotic to A' /I, where A' = V3(cosh(0.104) - 1)/2tt = 0.00149 . . .. 

Alan Reid has pointed out to me that the methods of [1] may be used to show that certain number 
fields K have the property that for every e > there is a closed, orientable manifold having trace 
field K and containing a closed geodesic of length < e. In these cases the conclusion of Theorem 
1.3 is non- vacuo us. 

For the Margulis number itself, we have the following result. 

Theorem 1.4. Let K be a number field. Then up to isometry there are at most finitely many 
closed, orientable hyperbolic 3-manifolds with the following properties: 

(1) M has integral traces, and K is its trace field; 

(2) Hi{M; Zp) = /or p = 2, 3 and 7; and 

(3) 0.183 is not a Margulis number for M. 

Theorems 1.1 and 1.2 are applications of a general result which we state below as Theorem 1.6. 
Theorems 1.3 and 1.4 are not formally consequences of Theorem 1.6, but they rely on most of the 
ingredients in the proof of the latter theorem. The statement of Theorem 1.6 involves the following 
technical definition. 

Definition 1.5. Let O be the ring of integers in a number field. An element r of O will be called 
nifty if either 

(i) r and — 2 are both non- units in O, or 

(ii) T — 1 and r + 1 are both non-units in O. 

The element r will be called swell if (i) holds. 

Theorem 1.6 will be stated in terms of subgroups of SL2(C) rather than Isom+(]HI^); the transition 
between the statement and the applications to subgroups of Isom-|-(]HI^) will involve observations 
to be made in Subsection 2.4 below. 

Theorem 1.6. Let T < SL2(C) be a non- elementary torsion-free discrete group having integral 
traces, and suppose that HiiT; Zp) = for p = 2,3 and 7. Let O denote the ring of integers in the 
trace field ofV. Let x and y be non-commuting elements ofV. Suppose that each of the elements 



(1.3.1) 



sinh^ R 
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X and y is a power of an element of T whose trace is a nifty element of O. Then for any point 
P G H'^ we have 

max{d{P,x ■ P),d{P,y ■ P)) > 0.3. 

Furthermore, if each of the elements x and y is a power of an element of T whose trace is a swell 
element of O, then for any P we have 

max(ci(P, n(x) • P),d{P,U{y) ■ P)) > 0.3925. 

In Sections 9 and 10, Theorems 1.1 and 1.2 will be deduced from Theorem 1.6. Using observations 
made in 2.4 below, one can interpret Theorems 1.1 and 1.2 as saying that if K is either an imaginary 
quadratic field, or a cubic extension of Q which is not a subfield of R, and if T is a discrete torsion- 
free subgroup of SL2(C) whose trace field is K, then for any non-commuting elements x and y of 
r and for any P G H'^ we have 

(1.6.1) max(d(P, x ■ P), d{P, y ■ P)) > /i, 

where we set ^ = 0.395 in the quadratic case and = 0.3 in the cubic case. If K is quadratic and 
the traces of x and y are both swell, or if K is cubic and the traces of x and y are both nifty, this 
follows from Theorem 1.6. All the work in Sections 9 and 10 consists of classifying the non-swell 
elements in an imaginary quadratic field and the non-nifty elements in a cubic extension of Q which 
is not a subfield of R, and showing that when the trace of x or y is among these elements, the 
inequality 1.6.1 follows from either a direct geometric estimate or a variant of the proof of Theorem 
1.6. 

In Sections 11, Theorems 1.3 and 1.4 will be deduced from the methods used to prove Theorem 
1.6. The key ingredient in doing this is Theorem 11.2, which asserts that the ring of integers of 
an arbitrary number contains at most finitely many non-nifty elements. This will be in turn be 
deduced from a fundamental finiteness theorem for solutions to the 5-unit equation in a number 
field, due to Siegel and Mahler. 

The proof of Theorem 1.6 occupies Sections 3 — 8. Here, in order to indicate the basic idea of the 
proof, 1.6, I will give a sketch under a large number of simplifying assumptions. I will confine the 
discussion to the case in which trace x is swell, and in place of the hypothesis that i7i(r;Zp) = 
for p = 2, 3 and 7, I will make the stronger assumption that Hi^T] Z) = 0. It is easy to reduce the 
proof to the case where T < SL2(C'), where O is the ring of integers in a number field containing 
the trace field. The assumption that r is swell means that r and — 2 are non-units in O, so 
that there are prime ideals pi and p2 in O such that r G pi and — 2 G p2. For i = 1,2 let ki 
denote the finite field O/pj. As an additional simplifying assumption, I will limit the discussion to 
the case where ki and k2 are both of odd characteristic. 

The quotient homomorphism O ^ ki defines a homomorphism SL2(0) PSL2(/ci), whose restric- 
tion to r I will denote here by hi. Set Gi = hi{dr) and = hi(x), and let tj denote the trace of a 
representative of ^i in SL2(fej). Our choice of the pi guarantees that ti = and that t2 = 2. Using 
the assumption of odd characteristic one can deduce that the orders of ^i and ^2 in Gi and G2 are 
2 and 4 respectively. 

In particular the Gi are non-trivial. The assumption that HiiT-jZ) = then implies that the Gi 
are non-solvable. The non-solvable subgroups of SL2(fc), where A; is a finite field, were classified 
by Dickson (see [17, II. 8. 27], and in particular they are simple. The simplicity of the Gi, together 
with the information that ^1 and ^2 have different orders, allows one to deduce that the diagonal 
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homomorphism /i : 7 i— > (/ii(7), /i2(7)) is a surjection from F to G = Gi x G2. Note that ^ = 
{Cit^2) = h{x) G G has order 4. 

Dickson's classification given in [13] guarantees that the 2-Sylow subgroup Tj of each Gt is non- 
cyclic. This implies in turn that if T is a 2-Sylow subgroup of G then Hi{T;'Z2) has Z2-rank at 
least 4. Since has order 4, we may choose T to contain ^. Furthermore, if we set rj = h{y), we 
have rj^'^Tj^^ = 1 G T. Hence the finite-index subgroup T = h~^{T) of T contains x and yx^y~^ . 

The surjectivity of h implies that d\m.x2 T > dim^j 7" > 4. As T is the fundamental group of an 
orientable hyperbolic 3-manifold, one can then deduce, using [25, Proposition 1.1], that the two- 
generator subgroup (x,yx^y~^) has infinite index in T; and then, using [4, Theorem 7.1] and the 
assumption that x and y do not commute, that {x,yx^y~^) is a free group of rank 2. 

The freeness of {x, yx^y~^) can be used to prove the inequality max{d{P, x ■ P) , d{P, y-P)) > 0.3925, 
by means of [3, Theorem 4.1], which is in turn a consequence of the main theorem of [4] together 
with the lameness theorem of Agol [2] and Calegari-Gabai [9]. As a final simplification of the 
present sketch, I will limit myself to establishing the weaker lower bound of 0.375 for max((i(P, x ■ 
P),diP,yP)). 

Theorem 2.2 below, which is a special case of [3, Theorem 4.1], asserts that if X and Y are elements 
of Isom_(_(C) that generate a rank-2 free discrete group, we have 

1/(1 + exp((i(P, X ■ P))) + 1/(1 + exp(d(P, Y ■ P))) < 1/2 

for any P G M^. If we assume that max(d{P, x ■ P), d{P, y • P)) < 0.375, and take X = n(x) and 
Y = n(yx^y~^),the triangle inequality gives d{P, Y-P) < 6 x 0.375 and hence 

1 1^1 1 

l + exp{d{P,x- P)) l+exp{d{P,yx*y-^ ■ P)) " 1 + exp(0.375) 1 + exp(6 x 0.375) 

= 0.502 . . . , 

a contradiction. This completes the sketch. 

Sections 3 and 4, following the preliminary Section 2, are largely devoted to general algebraic 
background for the technical arguments in Sections 5 and 6, which are refinements of the algebraic 
steps described in the sketch above. The main results of these sections are Lemmas 5.8 and 
5.9, which concern finite-index subgroups of general subgroups of SL2(C') where O is the ring of 
integers in a number field, and Propositions 6.3 and 6.4, which adapt the results of Section 5 to 
the 3-manifold situation. Most of Sections 7 and 8 are devoted to the spherical geometry and 
hyperbolic trigonometry needed to refine the application of the triangle inequality in the sketch 
above. 

I am grateful to Marc Culler, Jason DeBlois, Ben McReynolds, Alan Reid and Steven Smith for 
helping to make this a better paper. Smith helped me with the finite group theory needed in 
Sections 3 and 5. It was Reid who told me about Nagell's paper [20], which halved the amount of 
calculation needed for the results in Section 10, and about the result of Siegel and Mahler which 
made the results of Section 11 possible. 

2. Conventions and preliminary observations 

2.1. I will say that two elements of a group T are independent if they generate a rank-2 free 
subgroup of r. 
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As in the introduction, hyperbolic distance in H'^ will be denoted by d. On the other hand, I will 
denote by ds the spherical distance function on S'^, in which the distance between antipodal points 
is equal to vr. 

The following result, which was mentioned in the Introduction, is crucial for all the main results of 
this paper. 

Theorem 2.2. Let P be a point o/H^, and let X and Y be independent elements o/Isom-|-(C) that 
generate a discrete group. Then we have 

(2.2.1) 1/(1 + exp(d(P, X ■ P))) + 1/(1 + exp(fi(P, Y ■ P))) < 1/2. 



Proof. This is the case /c = 2 of [3, Theorem 4.1]. □ 

2.3. As was already mentioned in the introduction, I will fix throughout the paper a surjective 
(continuous) homomorphism 11 : SL2(C) — > Isom+(E[^) with kernel {ibid}. The natural action of 
Isom+(]HI'^) on H'^ pulls back via 11 to an action of SL2(C) on H^. Both these actions will be denoted 
by [g, z) g ■ z. In particular we have n(A) ■ z = A - z for any A G SL2(C) and any z G H^. 

2.4. If r < SL2(C) is cocompact and torsion-free, then 11 maps T isomorphically onto a cocompact 
(and torsion-free) subgroup of Isom+(EI^). Conversely, it follows from [10, Proposition 3.1.1] that 
any cocompact torsion-free subgroup of Isom-|-(]HI'^) is the isomorphic image under 11 of a cocompact 
(and torsion-free) subgroup of SL2(C). 

2.5. I will denote the complex length of a loxodromic isometry g G Isom+(]HI^) by Clength g. Recall 
that Clength 5( is a complex number defined modulo 27ri and having positive real part. If 2; is a 
point of H^, and if we set D = dist{z,g • z), then the distance from z to the axis of g is equal to 
u;(Clength (7, D), where u is the function defined by 

/^^-.N /, ■ , / /cosh I) — cosh Z \ 

(2.5.1) uj(l + i9, D) =aicsmh{ { — — 

V V cosh I — cos 6 J 

whenever D > I > and 9 G H. The formula (2.5.1) shows that for any 6 and any / > 0, the 
function uj{l,9, ■) is a continuous, monotonically increasing function on (/,oo). 

11 A £ SL2(C) and trace j4 G [—2, 2], the trace of A and the complex length of n(A) are related by 

(2.5.2) trace A = ±2 cosh(Clength(n(yl)) /2). 

(Note that the expression 2 cosh(Clength(n(^)))/2 is defined only up to sign since Clength(n(^)) 
is defined modulo 27ri.) 

The translation length of a loxodromic isometry g is denoted lengthy. It is equal to the real part 
of Clength g. 

2.6. The cardinality of a finite set S will be denoted by \S\. 

2.7. If is a number field, its ring of integers will be denoted by Oe- 

If K and E are number fields, with K C E, we have Ok = Oe H K. In view of Definition 1.5, it 
follows that an element of Ok is nifty (or swell) in Ok if and only if it is nifty (or, respectively, 
swell) in Oe- 
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2.8. Expressions in decimal notation are to be taken literally. For example, if x is a real number, 
to write x = 0.39 means that x is the rational number 39/100. To write x = 0.39 . . . means that x 
lies in the interval [39/100,40/100]. With this understanding, expressions in decimal notation may 
legitimately be used in proofs. For example, from x = 0.395 ... we may deduce that = 0.156 . . .; 
but from from x = 0.375 ... we may deduce only that x^ = 0.14 . . .. 

3. Some properties of SL2(Fg) 

Lemma 3.1. Let k be a finite field, let p denote its characteristic, let g be an element o/SL2(/c), 
set t = trace g £ k, and let m denote the order of g. 

(1) lft = then m divides 4. More specifically, in this case we have m = 4 if p ^ 2, and m < 2 
ifp = 2. 

(2) If = 2 then m divides 8, and is equal to 8 if p ^ 2. 

(3) If t = —1 then m divides 3, and is equal to 3 if p ^ 3. 

(4) If t = 1 then m divides 6. More specifically, in this case we have m = Q if p is not equal to 
2 or 3; we have m = 3 if p = 2; and we have m = 2 or m = 6 if p = 3. 

Proof. The characteristic polynomial of g is X'^ — tX + 1. By the Cayley-Hamilton theorem we 
have 

(3.1.1) g'^ -tg + id = 0. 

If t = then (3.1.1) gives g'^ = —id. Hence g'^ = id, and g^ = id if and only ii p = 2. This proves 
(!)• 

If t'^ = 2 then g'^ + id = [g"^ — tg + id){g'^ + tg + id), which in view of (3.1.1) implies that g'^ = —id. 
Hence g^ = id, but g"^ ^ id unless p = 2. This proves (2). 

If t = — 1 then g^ — \d = {g — id)(gi^ — tg + id), which in view of (3.1.1) implies that g^ = id. Since 
the identity has trace 2, we cannot have g = '\d unless p = 3. This proves (3). 

Now suppose that t = 1. We have g ^ \A in this case, since 2 is not congruent to 1 modulo any 
prime. On the other hand we have g^ + \d = {g + id)((7^ — tg + id), which in view of (3.1.1) implies 
that g^ = —id. Hence g^ = id, and g'^ = id if and only if p = 2. Furthermore, if g^ = id, it follows 
from (3.1.1) and the assumption t = 1 that g = 2id; taking traces of both sides we obtain 1 = 4 G A; 
and hence p = 3. This proves (4). □ 

Definitions 3.2. Let A; be a finite field, and let G be a subgroup of SL2(fc). We shall say that G is 
potentially triangularizable if there is a finite extension / of k such that G is conjugate in GL2(/) to 
a group of upper triangular matrices in SL2(/). A subgroup of PSL2(A;) will be said to be potentially 
triangularizable if its pre-image under the natural homomorphism SL2(A;) PSL2(A:) is potentially 
triangulariz able . 

Proposition 3.3. Let k be a finite field and let G be a subgroup ofPSh2{k). Suppose that Hi(G; Z2) 
and Hi{G]'L^) are trivial, and that G is not potentially triangularizable. Then at least one of the 
following alternatives holds: 

(i) G is conjugate in PGL2(/c) to PSL2(/i;o); for some subfield kQ of k; or 
(a) G is isomorphic to the alternating group A5. 
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Proof. It follows from the proof of [17, II. 8. 27] that for any finite subgroup G of PSL2(A;), either 
(i) or (ii) holds, or G is potentially triangularizable, or G is isomorphic to A4, or S5 or a dihedral 
group D2n- But 7/1(^4; Z3), Hi{S^]'L2) and Hi{Dn;'^2) are non-trivial. □ 

Corollary 3.4. Let k be a finite field and let G be a solvable subgroup o/SL2(/c). Suppose that 
Hi{G;7j2) and ffi(G;Z3) are trivial. Then G is potentially triangularizable. 

Proof. Suppose that G is not potentially triangularizable. Let G denote the image of G under the 
quotient map SL2(A;) — > PSL2(fc). Then G is not potentially triangularizable. Hence one of the 
alternatives (i) or (ii) of Proposition 3.3 holds. If (i) holds and |A;o| > 3 or if (ii) holds, then Go is 
a non-abelian simple group and the solvability of G is contradicted. If (i) holds and |fco| < 3, then 
either Hi(G;7j2) or Hi{G]'L-i) is non-trivial, and we have a contradiction to the hypothesis that 
Hi{G; Z2) and Hi{G; Z3) are trivial. □ 

Notation 3.5. If G is a finite group and £ is a prime, I will denote by cre{G) the dimension of the 
Z^- vector space Hi(T; Z^), where T is the -£-Sylow subgroup of T/N. 

Lemma 3.6. Let k be a finite field of odd characteristic, and let G be a subgroup o/SL2(A;). Suppose 
that Hi(G;7j2) and Hi{G]'L^) are both trivial, and that G is not potentially triangularizable. Then 
G has the following properties: 

(1) The center Z of G has order 2. 

(2) G/Z is a non-abelian simple group. 

(3) The only normal subgroups of G are G, Z and the trivial subgroup. 

(4) Every exact sequence of the form 

{id} — . C — ^ G — ^ G — > {id}, 

where G is a cyclic group of order 2 and G is an arbitrary finite group, is split. 

(5) For every prime I we have a^{G) = aii[G/Z). 

(6) If X is any element of G and if m denotes its order, then the order of xZ in G/Z is equal to 
m if m is odd, and to m/2 if m is even. 

(7) The order of G/Z is divisible by 6. 

Proof. Let P : SL2(A;) PSL2(A;) denote the quotient map, and set G = P{G). By Lemma 3.3 
we may assume that either G = PSL2(A:o) for some subfield ko of A;, or G = A5. In the case 
G = PSL2(A;o), let q denote the order of ko; we have |G| = {q^ — q)/2. Hence |G| is divisible by 6 
in this case. In the case G = ^5, we have |G| = 60. Hence: 

3.6.1. |G| is divisible by 6. 

To prove the remaining assertions, we first observe that —id is the only element of order 2 in SL2(A;), 
and hence that: 

3.6.2. Every even-order subgroup of Sh2{k) contains —id. 

Since G has even order by Assertion (7), it follows from 3.6.2 that the order-2 subgroup (—id) of 
SL2(A:) is contained in G. Note that (—id) is central in SL2(A;) and is therefore contained in the 
center Z of G. Furthermore, (—id) is the kernel of P\G. Hence G/(— id) = G. 

Since Hi{G;Z2) and Hi{G]'L^) are both trivial, -ffi(G;Z2) and Hi{G]'L^) are also trivial. In the 
case G = PSL2(A;o)5 it follows that g > 4, and hence that G is simple and non-abelian. Since the 
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only other case is G = ^5, it follows that G is always a simple non-abelian group. Since (—id) < Z, 
and since id) = G in particular has a trivial center, it follows that (—id) = Z. This establishes 
Assertions (1) and (2). 

Since we have shown that G/Z = G, Assertion (7) now follows from 3.6.1. 

Now let X be any element of G, and let m denote its order. If m is even, we have Z = (id) < {x) 
by 3.6.2. If m is odd then (x) f] Z = {id} since Z has order 2. This proves Assertion (6). 

If is any normal subgroup of G, then NZ is a normal subgroup of G, which is simple. Hence either 
NZ is trivial, in which case N is either Z or the trivial subgroup, or NZ = G. In the latter case, 
it follows from Assertion (7) that N has even order, and hence by 3.6.2 we have Z = (—id) < N. 
Combining this with NZ = G we deduce that = G in this case. This proves Assertion (3). 

For an odd prime i, Assertion (5) follows from Assertion (1). To prove Assertion (5) for ^ = 2, we 
note that G/Z has an element of even order by Assertion (7), and hence by Asssertion (6) that G 
has an element of order 4. If T denotes a 2-Sylow subgroup of G, it follows that T has an element of 
order 4. Since the generator —id of Z is the unique element of order 2 in G, we deduce that —id is 
a square in T. This implies that the natural homomorphism Hi(T) Hi{T /Z) is an isomorphism. 
As T/Z is the 2-Sylow subgroup of G/Z, this shows that (T2(G) = (72 (G/Z), and Assertion (5) is 
established. 

It remains to prove Assertion (4). Suppose that 

(3.6.3) {id} >C >G-^G > {id} 

is an exact sequence, where G is a finite group and G is cyclic of order 2. Then Z = p^^{Z) is normal 
in G; furthermore, Z has order 4 and is therefore abelian. Hence the action of G on Z by conjugation 
defines an action of G/Z on Z, which is described by a homomorphism a : G/Z Aut(Z). But 
G/Z is isomorphic to G/Z and is therefore simple and non-abelian by Assertion (2). Since |G| = 4, 
the group Aut(Z) is isomorphic to a subgroup of 53 and is therefore solvable. Hence a is the 
trivial homomorphism. This means that Z is central in G, so that G is a central extension of 
G/Z ^ G/Z ^ G. 

In the notation of [15], G is isomorphic either to ^5 = Ai{4) or to PSL2(A;o) = ^i(l^ol)) where 
is a field of order at least 4. It therefore follows from [15, Theorems 6.1.2 and 6.1.4] that the Schur 
multiplier H'^{G;C*) is isomorphic to Z2. Hence every perfect central extension of G has order 
at most 2|G| ([5, (33.8)]). Since |G| = 4|G|, it follows that G is not perfect. Thus there exists a 
surjective homomorphism (3 : G ^ A for some non-trivial abelian group A. Set = ker (3. Then 
N = a{N) is normal in G, and by Assertion (3) we have either N = G or \N\ < 2. 

If |A^| < 2 then |A^| < 4, and so N is abelian. Since G/N = A is also abelian, G is solvable, which 
is impossible since it admits a surjective to the simple non-abelian group G. Hence N = G, i.e. p 
maps N onto G. But |G| = 2|G|, and so if p| : ^ G had a non-trivial kernel, we would have 
N = G; this is impossible because G/N is isomorphic to the non-trivial group A. Hence p maps A^ 
isomorphically onto G. Thus the sequence (3.6.3) splits, and Assertion (4) is proved. □ 

Proposition 3.7. Let k be a finite field, and let G < PSL2(A;) be a subgroup. Suppose that G is 
not potentially triangularizable, and that H\{G]'L2) and Hi{G]'L^) are trivial. Then 0"2(G) > 2. 

Proof. Let T denote the 2-Sylow subgroup of G. We are required to show that dim^j Hi{T] Z2) > 2. 
Since T is 2-nilpotent, it suffices to show that T is non-cyclic. 



MARGULIS NUMBERS AND NUMBER FIELDS 



10 



Since Hi{G]'L2) = and G is not potentially triangularizable, it follows from Lemma 3.3 that G is 
isomorphic either to PSL2(feo) for some finite field /cq, or to the alternating group ^5. In the latter 
case, T is isomorphic to Z2 x Z2. 

Now suppose that G = PSL2(A;o) for some Ajq. If has characteristic 2 then T is isomorphic to the 
additive group of ko, which is non-cyclic unless |A;o| = 2. However, in the latter case G is isomorphic 
to PSL2(F2), and we have Hi{G;Z2) 7^ 0, a contradiction to the hypothesis. 

There remains the case that ko has characteristic p > 2. In this case we set q = \ko\, and we 
consider the sets 

S = {x^ : X G k(j} C ko 

and 

T = {-1 -x'^ : X G ko} C ko- 
We have |5| = \T\ = (g + l)/2. Hence |5| + |T| > q, so that SnT / 0. We fix an element c E SnT, 
and we fix a, 6 G ko such that = c and —1 — = c. Then + 6^ = —1, and hence the matrix 

a b\ 
b -aj 



M 




belongs to SL2(A;o). We set 



The elements A and M of SL2(/co) have trace and hence have order 4 by Lemma 3.1. By direct 
calculation we find that AM = —MA. a,b € ko- If tt : SL2(/co) ~^ PSL2(A;o) denotes the natural 
homomorphism, it follows that tt{A) and 7r(M) commute. On the other hand, A and M do not 
commute since p ^ 2. If we had 7r(M) = tt^AY for some e G {1, —1}, we would have M = ztA*^, and 
A and M would commute, a contradiction. Hence the subgroup of PSL2(A;o) generated by vr(^) and 
7r{B) is isomorphic to Z2 x Z2, and the 2-Sylow subgroup of PSL2(A;o) is therefore non-cyclic. □ 



4. Congruence kernels 



Definition 4.1. A normal subgroup of a group T will be termed cosolvable if T /N is solvable. 

Notation 4.2. Let E he a number field and let 3 be an ideal in Oe- I will denote by Rj the ring 
Oe/'^- If ^ is a prime ideal then Rf^ is a finite field which I will denote by /csp. 

If '3 is any ideal in Oe, I will denote by 773 : Oe R3 the quotient homomorphism. I will denote 
by /13 the natural homomorphism GL2(C'£;) — > GL2(i?o), defined by 

[c d) \rj^{c) mid)) 

for (l ^) . SL2(C^.). 

Definition 4.3. Let T be a group and let p be a prime number. A subgroup A of T will be called 
a characteristic-p congruence kernel if there exist a number field E, an injective homomorphism p : 
r — > SL2(C'£;) < GL2{Pe), and a prime ideal *P in Oe, such that char kep = p and A = ker(/i(p o p). 

A subgroup A^ of a group T will be called a congruence kernel if it is a characteristic-p congruence 
kernel of T for some prime p. I will call A an odd- characteristic congruence kernel if it is a 
characteristic-p congruence kernel for some odd prime p. 
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Note that a congruence kernel in F is a normal subgroup of finite index in T. 

4.4. Let G be a group, and let p be a prime number. Recall that the mod p lower central series 
(A„(G,p))„,>i is the sequence of subgroups defined recursively by setting Xi{G,p) = G, and defining 
Xn-\-i{G,p) to be generated by all p-th powers in A„(G,p) and all commutators of the form [x,y] 
with X £ G and y G Xn{G,p). We call X2{G,p) the mod p commutator subgroup of G. Recall 
that G is p-nilpotent if and only if Xn{G,p) = {id} for some n, and that G is said to be residually 
p-nilpotent if Cl^yi ^n{G,p) = {id}. 

Proposition 4.5. Let p be a prime, and suppose that N is a characteristic-p congruence kernel in 
some group T. Then N is residually p-nilpotent. 

Proof. It suffices to prove that if is a number field and is a prime ideal in Oe, then the kernel 
of /isp : GL2{Oe) GL2(A;(p) is residually p-nilpotent, where p = char A;<p. 

For each n > 1, let Gn denote the kernel of /i(pn : GL2(0£;) GL2(i?<pn). Then Gi/Gn is 
isomorphic to the kernel Kn of the natural homomorphism GL2(-R(p") GL2(fc(p). 

If Mn denotes the ring of 2 x 2 matrices over R^^n then Kn C id + ^PM„. On the other hand, for 
any g G \d + ^M„, we have det g = 1 (mod Since R'^n is a local ring with maximal ideal 
^i?>pn it follows that detg is invertible in i^sp™ and hence that g G GL2(i?<p"). This shows that 
id + *PM„ C GL2(iiq3n), from which it follows that K„ = id + *PM„. Hence = |*PM„|. 

The local ring iJqjn has residue field /ctp of characteristic p; hence |-R<pn| is a power of p. Since 
is a submodule of M„, which is a rank- four free module over i?(pi, the order of is also 

a power of p. This shows that Kn is a finite p-group, and is therefore p-nilpotent. Hence Gi/Gp is 
p-nilpotent, and so for some jn we have 

(4.5.1) A,„(Gi,p). 

Finally, since 0^=1 = {0}, we have (1^=1 = {id}. Hence it follows from (4.5.1) that 
n^iAj„(Gi,p) = {id}, which implies that Gi is residually p-nilpotent, as required. (This ar- 
gument is similar to the discussion on p. 87 of [14]). □ 

Lemma 4.6. Let T be a finitely generated subgroup of SL2(C'_b) which has no abelian subgroup 
of finite index. Let p be a prime, and let N be a characteristic-p congruence kernel in F. Then 
dimz^Hi{N;Zp)>2. 

Proof. According to Proposition 4.5, N is residually p-nilpotent. Let N = Ni > N2 > • • • de- 
note its mod p lower central series. Assume that the conclusion of the lemma is false, so that 
dimzp LLi{N; Zp) < 1. Then for each n > 1, the group N/Nn is p-nilpotent, and dim^^ Hi{N/Nn) < 
1. It follows that N/Nn is cyclic, and in particular abelian, for every n. Hence for any elements 
x,y £ N, the commutator [x, y] has trivial image in N/Nn for every n; that is, [x, y] lies in PlJ^i ^n, 
which is trivial by residual p-nilpotence. This shows that N is abelian. But has finite index in 
F since F is finitely generated, and we have a contradiction to the hypothesis. □ 

Notation 4.7. Let 7 be an element of a group F, and let A be a finite-index normal subgroup of 
F. We will denote by 7ti-7v(7) the order of xN in T/N. 

Lemma 4.8. Let p be a prime, and let N be a characteristic-p congruence kernel in a group F. 
Let an element 70 G F 6e given, and set m = m]y{'yo). Suppose that 
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(1) m > 1, and m ^ p; and 

(2) Hi(T; Z,) =0 for s = 2,3 and m. 

Then N is not cosolvahle. 

Proof. Assume that N is cosolvable. Since is a characteristic-^ congruence kernel, T/N is 
isomorphic to a subgroup G of SL2(A;) for some field k of characteristic p. Since ffi(r;Z2) and 
HiiT]'L^) are trivial, so are Hi(G;7j2) and Hi{G]'L^). The cosolvability of N implies that G 
is solvable. By Corollary 3.4 it follows that G is potentially triangularizable, and is therefore 
isomorphic to a group of upper triangular matrices in SL2(A;') for some finite field k' . Hence there 
is a short exact sequence 

>U ^G^Q ^0, 

where Q is isomorphic to a subgroup of the multiplicative group of /c', while U is isomorphic to 
a subgroup of the additive group of k\ which has exponent p. Since i^i(r;Zm) = we have 
Hi(G;7jm) = 0, and hence the order of Q is relatively prime to m. But by the definition of 
m = mAr(7o), the order of go in G is m, and we must therefore have (3{go) = 0. Hence go is in the 
image of a, so that U has an element of order m. This implies that m\p, which contradicts the 
hypothesis. □ 

5. Constructing useful quotients 

The main results of this section are Propositions 5.8 and 5.9, which constitute the main algebraic 
steps in the proof of Theorem 1.6. 

Notation 5.1. Let be a finite-index normal subgroup of a group F, and let £ be a prime. I will 
denote by yi{N) the number a£(T/N). I will denote by ne{N) the dimension of the Z^-vector space 

Lemma 5.2. Let Ni and N2 be finite-index normal subgroups of a group T, and let u be an element 
ofV. Suppose that 

• Ni is a non- CO solvable odd- characteristic congruence kernel; 

• N2 is either a cosolvable normal subgroup of odd index or a non- cosolvable odd- characteristic 
congruence kernel; 

• mN^{u) 7^ mjv2(u); and 

. i7i(r;Z2) =i?i(r;Z3) =0. 

For i = 1,2, set 9i = m^.^u) if mi^.{u) is odd, and 6i = m]sf.{u)/2 if ■mf^-{u) is even. Then there 
exists a finite-index normal subgroup N inV such that 

(1) vi{N) = U(_{Ni) + ve.{N2) for every prime i, and 

(2) mi\j{u) is the least common multiple of 9i and 62. 

Proof. For i = 1,2 we set Gi = T/Ni and gi = uNi £ Gi. We define a homomorphism /i : L — > 
Gi X G2 by setting h{-/) = {-/Ni,jN2) for every 7 G L. We set W = h{T) < Gi x G2. For i = 1,2, 
if vTj denotes the projection from Gi x G2, it is clear that TTi\W : W ^ Gi is surjective. 

Since A^i is a characteristic-pi congruence subgroup of F, the group Gi is isomorphic to a subgroup 
G'l of SL2(A;i), for some finite field ki of characteristic pi. Since A^i is not cosolvable, G'l is not 
potentially triangularizable. 
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For s = 2 and 3, since Hi^T] TLg) = 0, we have H\{G\\ TLs) = and hence KxiG'^^TLs) = 0. Hence the 
conclusions of Lemma 3.6 hold with G\ playing the role of G. In particular, according to conclusion 
(1) of Lemma 3.6, the center of Gi, which we will denote by Zi, has order 2. 

Set J = n {G\ X {id}). Then J is the kernel of the surjective homomorphism 7r2\W : W G2. 
Hence we have an exact sequence 

(5.2.1) {id} >J >w"-^G2 ^{id} 

We claim: 

5.2.2. J is a normal subgroup of Gi x {id}. 

To prove 5.2.2, we consider an arbitrary element x of J and an arbitrary element z of Gi x {id}. 
Let us write x = (^,id) and z = (Ciid), where x,z G Gi. Since 7ri\W : W ^ Gi is surjective, 
there is an element y W such that iTi{y) = (. We may write y = (d^^) for some v G G2- Since 
X £ J C W and y £ W, we have yxy^^ G W. But 

yxy~' = {C,v){C,id){C\v-') 
= zxz~^, 

from which it follows that zxz~^ belongs to W and hence to J. This proves 5.2.2. 

In view of 5.2.2 and conclusion (3) of Lemma 3.6, J must be equal to Gi x {id} or Zi x {id} or 
{id} X {id}. Hence: 

5.2.3. Either J = Gi x {id} or \J\ < 2. 
We claim that, in fact, we have 

(5.2.4) J = Gix{id}. 

I will prove (5.2.4) by assuming that | J| < 2 and deriving a contradiction; (5.2.4) will then follow 
in view of 5.2.3. 

According to the hypothesis, N2 is either cosolvable or an odd-characteristic congruence kernel. If 
I J| < 2, and if N2 is cosolvable (so that G2 is solvable), then the exactness of (5.2.1) implies that 
W is solvable. Since vri 1 1^ : — > Gi is surjective, it follows that Gi is solvable. But according to 
conclusion (2) of Lemma 3.6, Gi has a quotient which is a non-abelian simple group. This is the 
required contradiction in this case. 

Now suppose that |J| < 2, and that is an odd-characteristic congruence kernel which is not 
cosolvable; thus G2 is non-solvable and is isomorphic to a subgroup G2 of SL2(A:2) for some finite 
field /c2- Since G2 is non-solvable, G2 is not potentially triangularizable. For s = 2,3, since 
HiiT-jZs) = 0, we have Hi{G2]'^s) = and therefore Hi{G'2]'^s) = 0. Hence the conclusions of 
Lemma 3.6 hold with G2 playing the role of G. 

If I J| = 2, we may apply conclusion (4) of Lemma 3.6, with W and G2 playing the respective roles 
of G and G, to deduce that the exact sequence (5.2.1) splits. In particular we have Hi(yV] Z2) 7^ 0. 
This is a contradiction, since W = h(r) and Hi^T] Z2) = by hypothesis. 

If I J| = 1, the exactness of 5.2.1 implies that 7r2\W is an isomorphism from W to G2. Let u; : 
G2 ^ W denote the inverse isomorphism, and set /3 = vri o a; : G2 ^ Gi. Since 7ri|VF : — > Gi 
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is surjective, /? is also surjective. Note that by the definition of the homomorphism h we have 
■K2{h{u)) = uN2 = 92, and hence oj[g2) = h{u). Hence 

/3(fi'2) = TTi{u{g2)) = 7ri(/i(u)) = uNi = gi. 

Since by hypothesis the orders niN^{u) and mN^{u) of gi and g2 are not equal, j3 cannot be an 
isomorphism. 

Let L 7^ {id} denote the kernel of (3. Applying Conclusion (3) of Lemma 3.6, with G2 playing the 
role of G, we find that either L = G2 or L is the center of G2. If L = G2 then Gi is trivial; this 
is impossible because by hypothesis A^i is not cosolvable, i.e. Gi is not solvable. Finally, if L is 
the center of G2 then Conclusion (2) of Lemma 3.6, applied with G2 in the role of G, implies that 
Gi = G2/L is a non-abelian simple group. But Conclusion (1) of Lemma 3.6, applied with Gi in 
the role of G, implies that Gi has a non-trivial center. Again we have the required contradiction. 
This proves 5.2.4. 

We now claim: 

5.2.5. W = Gi X G2, i.e. h is surjective. 

To prove 5.2.5, let an element g = ((71, 52) of Gi x G2 be given. Since '7T2\W : — > G2 is surjective, 
there is an element uofW such that iri{u) = g2- We may write u = (0,52) for some a G Gi. Then 
gu~^ = (5'ia~^,id) G Gi x {id}, and it follows from 5.2.4 that gu~^ e J CW. Since u G W it 
follows that g £ W, and 5.2.5 is established. 

We now claim: 

5.2.6. For each i £ {1,2} there exist a group Gi and a homomorphism Pi : Gi ^ Gi such that 
Pi{gi) has order 6i, and such that ai{Gi) = (Jt{Gi) for every prime I. 

To prove 5.2.6, we distinguish two cases. By hypothesis, either Ni is a non-cosolvable congruence 
kernel, or i = 2 and is a cosolvable normal subgroup of odd index. If A', is a non-cosolvable 
congruence kernel then for some finite field ki, the group Gi is isomorphic to a subgroup G^ of 
SL2(fci) which is not potentially triangularizable. For s = 2,3, since Hi^T^'Ls) = 0, we have 
-ffi(G^;Zs) = 0. Hence the conclusions of Lemma 3.6 hold with Gj playing the role of G. If we set 
Gi = GijZi^ where Zi denotes the center of Gj, and define Pi : Gi ^ Gi to be the quotient map, 
then the assertions of 5.2.6 follow from Assertions (5) and (6) of Lemma 3.6. 

On the other hand, if i = 2 and N2 is a cosolvable normal subgroup of odd index, we may set 
G2 = G2, and define P2 : G2 — > G2 to be the identity map. In this case, the order m]\f^{u) of 
g2 = P2{g2) is odd and hence equal to 62; and it is trivial that ai{Gi) = ai{Gi). Thus 5.2.6 is 
proved. 

Now fix groups Gi and G2, and homomorphisms Pi and P21 having the properties stated in 5.2.6. 
Set P = Pi X P2 : Gi X G2 ^ Gi X G2. Since /i : F ^ Gi x G2 is surjective, so is Po/i : F ^ Gi x G2. 
Hence if we set N = ker(P oh), we have T/N = Gi x G2. 

If ^ is a prime and denotes the ^-Sylow subgroup of Gi, then Ti x T2 is the £-Sylow subgroup of 
Gi X G2. Hence ai{Gi x G2) = cr^CGi) -|- cr£(G2)- In view of 5.2.6 it follows that 

at{Gi X G2) = (T£(Gi) + (Ji{G2), 

and hence 
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This gives conclusion (1) of the lemma. 

Finally, niN^u) is the order of P o h(u) in Gi x G2- Since P o h{u) = {Pi{gi) , P2ig2)) , and since 
Pi{gi) has order 6i in Gi) by 5.2.6, the order of P o h{u) in Gi x G2 is the least common multiple 
of 61 and ^2- This gives conclusion (2) of the lemma. □ 

Lemma 5.3. Let Ni and N2 be odd- characteristic congruence kernels in a group T, and let u be 
an element ofT. Suppose that 

• neither Ni nor N2 is cosolvable; 

• ruNii'-fo) / niN^i'lo); and 

. Hi{T;Z2) = Hi{T;Z3) = 0. 

For i = 1,2, set Oi = mAr,(u) if mi^.(u) is odd, and 9i = mjv.(n)/2 if m]\j-{u) is even. Then there 
exists a finite-index normal subgroup N inV such that 

(1) V2{N) > 4, and 

(2) mN{u) is the least common multiple of 9i and 62- 

Proof The hypotheses of the lemma immediately imply those of Lemma 5.2. Hence there exists 
a finite-index normal subgroup of L such that conclusions (1) and (2) of Lemma 5.2 hold. 
Conclusion (2) of Lemma 5.2 is conclusion (2) of the present lemma. 

For i = 1,2, the group T/Ni is isomorphic to a subgroup Gi of SL2(A;j) for some finite field ki. 
Since Ni is not cosolvable, Gi is not potentially triangularizable. Since Hi(T;7j2) = 0, we have 
Hi{Gi]'L2) = 0. Hence it follows from Proposition 3.7 that V2{Ni) = o"2(Gi) > 2 for i = 1,2. 
Conclusion (1 ) of Lemma 5.2 gives 

V2[N) = V2[Ni) + U2{N2) > 4, 
so that conclusion (1) of the present lemma holds. □ 

Lemma 5.4. Let T be an infinite, finitely generated group, let p be a prime, and let d > 2 be an 
integer. Let N be a characteristic-p congruence kernel in T. Suppose that N is cosolvable. Set 
G = T/N. Set 

S = {2,3,p}U{/-l : 1 <r <(i}, 

and suppose that Hi{r,Zs) = for every s £ S. Then either 

(i) the p-Sylow subgroup T of G is an elementary abelian p-group of rank at least d, normal in 

G, and G/T is a cyclic group whose order is not divisible by 2, 3 or p; or 
(a) G is a cyclic group whose order is not divisible by 2, 3 or p, and Kp{N) > d. 

Proof. Since N \s& characteristic-p congruence kernel, V /N is isomorphic to a subgroup G of SL2(A;) 
for some finite field k of characteristic p. Since N is cosolvable, G is solvable. Furthermore, since 
i/i(r,Zs) = for every s E 5, we have we have Hi{G;Zs) = and Hi{G;Zs) = for every s £ S. 
Hence Corollary 3.4 implies that G is potentially triangularizable. It follows that there exist a finite 
extension k' of k, a unipotent subgroup T <\ G, isomorphic to a subgroup of the additive group of 
k', such that Q = G/T is isomorphic to a subgroup of the multiplicative group of k' . In particular, 
T is an elementary abelian p-group, Q is cyclic, and p does not divide the order of Q. Hence T is 
the p-Sylow subgroup of G. 
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Since Q is a homomorphic image of G and hence of F, we have Hi(Q; Zg) = for every s E S*. In 
particular i?i(Q;Zp) = 0, and Hi{Q;7jpr_i) = for every strictly positive integer r < d. Since Q 
is a finite cyclic group, it follows that: 

5.4.1. \Q\ is relatively prime to every integer in the set S. 

We define a group G and a subgroup of G as follows. If the group T is non-trivial, we set G = G 
and W = T. If T is trivial, we define G to be T/N', where A^' denotes the mod p commutator 
subgroup (see 4.4) of N, and we set W = N/N' < G. In both cases, we claim: 

5.4.2. The group G is finite and W is a non-trivial elementary abelian p-group. Furthermore, W 
is normal in G, and G/W = Q. 

To prove 5.4.2, we first consider the case in which T is non-trivial. In this case G = G is isomorphic 
to a subgroup of SL2(A;') and is therefore finite, and we have W = T. By the discussion above, T 
is an elementary abelian p-group and is normal in G. Since we are in the case where T ^ {id} we 
have W ^ {id}. Furthermore, we have G/W = G/T = Q hy definition. Thus 5.4.2 is true in this 
case. 

Now consider the case in which T is trivial. Since N has finite index in the finitely generated group 
r, it is itself finitely generated; hence the mod p commutator subgroup A^' has finite index in 
and therefore in T, and so G = T/N' is finite. Since W = N/N' is finite and is the quotient of 
by its terms of the mod-p commutator subgroup, W is an elementary p-group. Furthermore, A^ is a 
non-trivial group because F is infinite, and A^ is residually p-nilpotent by Proposition 4.5. It follows 
that the mod p commutator subgroup A^' is a proper subgroup of A^, so that W is non-trivial in 
this case as well. The normality oiW = N/N' in G = T/N' follows from the normality of A^ in F. 
Furthermore, we have G/W = {T/N') /{N/N') = T/N = G, and G = Q since T is trivial. Thus 
5.4.2 is proved. 

We let r denote the rank of the elementary abelian p-group W. By 5.4.2 we have r > 0. We claim 
that 

(5.4.3) r>d. 

To prove (5.4.3) we first note that since W is abelian, the action of G by conjugation on <l G 
induces an action olG/W on W . This action is described by a homomorphism a : G/W — > Aut(I^). 
Since Aut(T^) ^ GLr(Fp), we have 

(5.4.4) |Aut(I^)| = (p'^-l)(/-p)---(p"-p"-i). 

Assume that r < d. Then it follows from (5.4.4) that |Aut(VF)| is a product of integers in the set 
S. It therefore follows from 5.4.1 that |G/Vl^| = \Q\ is relatively prime to |Aut(14/^)|. Hence the 
homomorphism a must be trivial. This means that the action of G/W on W is trivial, i.e. that 
Vl^ is a central subgroup of G. Since G/W is isomorphic to Q by 5.4.2, and is therefore cyclic, it 
follows that G is abelian. But since r > by 5.4.2, the prime p divides \W\ and hence divides G. 
As G is a homomorphic image of F it now follows that Hi{T, Zp) ^ 0. Since p G S", this contradicts 
the hypothesis. Thus (5.4.3) is proved. 

To establish the conclusion of the lemma, we first consider the case in which T is non-trivial. In 
this case we have G = G and W = T. We have observed that T is the p-Sylow subgroup of G. By 
5.4.2 and (5.4.3), T is an elementary abelian p-group of rank at least d, normal in G, and G/T = Q. 
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Furthermore, we have observed that Q is cychc, and by 5.4.1 its order is not divisible by 2, 3 or p. 
Thus Alternative (i) of the conclusion holds in this case. 

Now consider the case in which T is trivial. In this case we have G = G/T = Q. Thus G is cyclic, 
and and by 5.4.1 its order is not divisible by 2, 3 or p. In this case we have W = N/N' = Hi{N; Zp). 
Hence the rank r of the elementary abelian p-group W is equal to Kp{N), and in view of (5.4.3) it 
follows that Kp(N) > d. Thus Alternative (i) of the conclusion holds in this case. □ 

Remark 5.5. If Alternative (i) of Lemma 5.4 holds, then it follows from the definitions that 
Up{N) = ap{G) > d. 

Remark 5.6. If Alternative (ii) of Lemma 5.4 holds then Vp{N) < 1. In view of Remark 5.5 it 
follows that Alternatives (i) and (ii) of Lemma 5.4 are mutually exclusive. 

Lemma 5.7. Let T he a finitely generated group which has no abelian subgroup of finite index. 
Suppose that Hi{T, Zg) = for s = 2, 3, 7. Then for every characteristic-2 congruence kernel N in 
r, we have either (i) i'2{N) > 4 or (ii) K2{N) > 4. 

Proof. Suppose that is a characteristic-2 congruence kernel in T. We first consider the case in 
which r is cosolvable. In this case we apply Lemma 5.4 with p = 2 and d = 4. In the notation of 
Lemma 5.4 we have S = {1,2, 3, 7}. By the hypothesis of the present lemma we have Hi(T; Zg) = 
for s = 2, 3, 7; tautologically we have Hi(T; 7L\) = i^i(r; 0) = 0. Hence it follows from Lemma 5.4 
and Remark 5.5 that either t'2(-/V) > 4 or k,2{N) > 4. 

Now suppose that A'" is not cosolvable. Then for some finite field k of characteristic 2, the group 
G = T/N is isomorphic to a subgroup of SL2(A:) which is not potentially triangularizable. According 
to Lemma 3.3, G is isomorphic either to SL2(F2r) for some r > 1, or to the alternating group ^5. 
Since A5 is isomorphic to SL2(F4), the group G is in fact isomorphic to SL2(F2'-) for some r > 1. 
The 2-Sylow subgroup of G is an elementary abelian 2-group of rank r. In particular we have 
ly^iN) = a2{G) = r. If r = 1 then Hi{G;Z2) + 0, which is impossible since i?i(r;Z2) = by 
hypothesis. If r > 4 then alternative (i) of the present lemma holds. Hence we may assume that r 
is equal to 2 or 3. It follows that G is simple, and that the order (2^)'^ — (2*") of G is equal to 60 or 
504. 

We set V = Hi{N; Z2) and d = dim^^ V. It follows from Lemma 4.6 that d > 2. If d > 4 then by 
definition we have K2(A^) > 4, which is alternative (ii) of the lemma. I will now assume that d is 
equal to 2 or 3, and obtain a contradiction. 

We regard V as an elementary 2-group of rank d. We let C denote the kernel of the natural 
homomorphism from N to V. Then £ is a characteristic subgroup of A <l L and is therefore normal 
in r. We set R = T/C. There is a canonical isomorphic identification oiV = N/C with a subgroup 
of R, and we have R/V = {T/C) /{N/C) '^T/C^G. 

Since N is normal in L, the subgroup V is normal in R. Since V is abelian, the action of i? on V 
by conjugation induces an action of R/V = G on V. The action is described by a homomorphism 
a : R/V Aut(y). Since R/V = G is simple, a is either injective or trivial. We have Aut(y) = 
GLd(F2). Since d is equal to 2 or 3, the order of Aut(y) is either (2^-1) (2^-2) = 6 or {2^-l){2^- 
2) (2^ — 2^) = 168. Hence |G| does not divide the order of Aut(y), and a cannot be injective. 

Hence a is the trivial homomorphism. This means that V is central in R, so that i? is a central 
extension of R/V ^ G. In the notation of [15] we have G ^ SL2(F2r) = Ai{2'-)) and r G {2,3}. It 
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therefore follows from [15, Theorems 6.1.2 and 6.1.4] that the Schm' multiplier H'^{G\ C*) IS isomor- 
phic to Z2 . Hence every perfect central extension of G has order at most 2|(5| ([5, (33.8)]). Since 
\R\ = 2'^\G\ > 4|G|, it follows that R is not perfect. Thus there exists a surjective homomorphism 
f3 : R ^ A for some non-trivial abelian group A. 

If l3\V is the trivial homomomorphism then (3 factors through a homomorphism of R/V = G onto 
A. This is impossible since G is simple and non-abelian. If 13\V is non-trivial, then since \V\ 
is a power of 2, the order of A is even. This implies that R admits a homomorphism onto Z2, 
which is impossible since i? is a quotient of T and Hi(T]'L2) = 0. Thus we have the required 
contradiction. □ 

Lemma 5.8. Let T he a subgroup ofSJ-i2{OE) for some number field E. Suppose that i?i(r; Z<j) = 
for s = 2,3 and 7, and that T has no abelian subgroup of finite index. Let z be an element ofT, 
and suppose that trace z is swell in Oe- Then T has a finite-index normal subgroup N such that 
either 

(i) V2{N) > 4 and £ N, or 
(a) K2{N) > 4 and z^ G N . 

Proof. Set r = trace z. The hypothesis that r is swell means that r and — 2 are non-units in Oe. 
Hence there are prime ideals pi and p2 in Oe such that r G pi and — 2 G p2- For i = 1,2, we set 
= Pi = char A;^, hi = /i(p, and Ni = ker/ij. By definition Ni is a characteristic-pi congruence 
kernel. We set gi = hi{z) and ti = trace gi. We set rm = mMi{z), and note that rrii is the order of 
gi in SL2(A;j). Note also that 

h = m{r) = 

and that 

tl = V2{t^) = 2. 

First suppose that for some j G {1,2} we have pj = 2. If j = 1 we have tj = 0, and if j = 2 we 
have t| = 2 = 0. Hence in any case we have tj = 0. By assertion (1) of Lemma 3.1 it follows that 
nij < 2. Hence z'^ G Nj (and in particular z^ G Nj). On the other hand, by Lemma 5.7, we have 
either V2{N) > 4 or K2{N) > 4. Thus one of the alternatives (i) or (ii) of the present lemma holds 
in this case, with N = Nj. 

Now suppose that pi and p2 are both strictly greater than 2, so that A'^i and N2 are odd- 
characteristic congruence kernels. In this case it follows from assertions (1) and (2) of Lemma 
3.1 that mi = 4 and m2 = 8. Hence for i = 1,2 we have nii > 1 and nii ^ pi (since rrii is 
composite). Note also that since ifi(r,Z2) = 0, we have i^i(r,ZmJ = for i = 1,2. Since we 
also have HiiJ'; Z^) = for s = 2 and 3, it now follows from Lemma 4.8 that neither A'^i nor A^2 is 
cosolvable. Since in addition we have mi 7^ m2, the hypotheses of Lemma 5.3 hold. In the notation 
of that lemma we have 61 =2 and 62 = 4. Hence Lemma 5.3 gives a finite-index normal subgroup 
of r such that i'2{N) > 4 and miy{u) = 4. In particular z'* G A^. Hence alternative (i) of the 
present lemma holds in this case. □ 

Lemma 5.9. Let T be a subgroup ofSh2{OE) for some number field E. Suppose that ILi(T; Z^) = 
for s = 2,3 and 7, and that T has no abelian subgroup of finite index. Let z be an element ofT. 
Suppose that trace 2; is nifty but not swell in Oe- Then for some £ G {2,3}, there is a finite-index 
normal subgroup NofV such that max{u£{N) , K£{N)) > 4 and z^ G A^. 
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Proof. Set T = trace z. The hypothesis that r is nifty but not swell implies that r — 1 and r + 1 
are non-units in Oe- Hence there are prime ideals pi and p2 in Oe such that r — 1 G pi and 
r + 1 G p2. For i = 1, 2, we set ki = fcsp^, pi = char /cj, hi = /isp. and Ni = kev hi. By definition Ni 
is a characteristic-pi congruence kernel. We set gi = hi{z) and ti = trace ^j. We set rrii = niNiiz), 
and note that nii is the order of gi in Sh2{ki). Note also that 

= f?i(T) = 1 

and that 

t2 = m{T) = -1. 

First consider the case that for some j € {1,2} we have pj = 2. Then by Lemma 5.7 we have either 
T^2{Nj) > 4 or K2{Nj) > 4. By assertions (3) and (4) of Lemma 3.1, rrij is equal to 1 or 3, and hence 
G Nj. Thus the conclusion of the present lemma holds in this case, with N = Nj and £ = 2. 

We now turn to the case in which pi and p2 are both strictly greater than 2, so that A^i and 
N2 are odd-characteristic congruence kernels. According to Assertion (4) of Lemma 3.1, we have 
mi £ {2,6}. According to Assertion (3) of Lemma 3.1, we have 7712 S {1,3}. 

Since by hypothesis we have Hi(T, Z2) = ^^^i(r, Z3) = 0, it now follows for i — 1,2 that -f/^i(r, ^.^^) — 
0. Furthermore, we have mi > 1; and since pi is an odd prime, we have mi ^ pi. It therefore 
follows from Lemma 4.8 that: 

5.9.1. A'^i is not cosolvahle. 

We distinguish two subcases, depending on whether is cosolvable. First consider the subcase 
in which A^2 is not cosolvable. Since we have observed that mi G {2,6} and m2 G {1,3}, we have 
mi 7^ m2. Combining these observations with 5.9.1, we see that all the hypotheses of Lemma 5.3 
hold in this subcase. In the notation of Lemma 5.3 we have 61 £ {1,3} and O2 G {1,3}. Hence 
Lemma 5.3 gives a finite-index normal subgroup of F such that i^2(A) > 4 and rriN^z) G {1,3}. 
This implies the conclusion of the present lemma, with 1 = 2. 

The rest of the proof is devoted to the subcase in which N2 is cosolvable. In this subcase we claim: 
(5.9.2) p2 = 3. 

To prove this, assume that p2 ^ 3. Since we also have p2 7^ 2, it follows from Lemma 3.1 that 
m2 = 3, so that m2 > 1 and m2 7^ P2- Hence Lemma 4.8 implies that A^2 is not cosolvable, a 
contradiction. This proves 5.9.2. 

We now apply Lemma 5.4 with p = p2 = 3 and d = 3, and with N2 playing the role of N. In the 
notation of Lemma 5.4, we have S = {2,3,8}. By the hypothesis of the present lemma, Hi(T;Zs) 
is trivial for s = 2 and for s = 3. Since Hi(T;7j2) is trivial, so is Hi(T;Zs)- Hence if we set 
G2 = T/N2, Lemma 5.4 implies: 

5.9.3. Either (i) the 3-Sylow subgroup T of G2 is an elementary ahelian 3-group of rank at least 
3, normal in G2, and G2/T is a cyclic group whose order is not divisible by 2 or 3; or (ii) G2 is a 
cyclic group whose order is not divisible by 2 or 3, and K3{N2) > 3. 

By Remark 5.6, Alternatives (i) and (ii) of 5.9.3 are mutually exclusive. 

We now define a subgroup of F as follows. If Alternative (i) of 5.9.3 holds, we set = A^2- 
If Alternative (ii) of 5.9.3 holds, we define N2 to be the mod-3 commutator subgroup of A'^2- In 
either case, we set G2 = F/iV|, and let T* denote the 3-Sylow subgroup of G2. We claim: 
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5.9.4. T* is an elementary abelian 3-group of rank at least 3, normal as a subgroup of G2, and 
G2/T* is a cyclic group of odd order. 

If Alternative (i) of 5.9.3 holds, 5.9.4 is immediate. If Alternative (ii) of 5.9.3 holds, then K = 
N2/N2 is a normal subgroup of G2 = T/N2, and K is an elementary abelian 3-group since N2 is 
the mod-3 commutator subgroup of N2. Since G2/ K = T /N2 = G2 is a cyclic group whose order 
is not divisible by 2 or 3, we have K = T*. By the definition of k,3(N2), the rank of T* = K is 
^3(-^2) ^ 3. This completes the proof of 5.9.4. 

Next, we claim that 

(5.9.5) m^|(z) G {1,3}. 

If Alternative (i) of 5.9.3 holds, we have mj\f* = m2- As we have observed that 7712 G {1)3}, (5.9.5) 
holds in this case. In the case where Alternative (ii) of 5.9.3 holds, m2 = m^^^z) G {1,3} is the 
order of ZN2 in T/N2, which is a cyclic group whose order is not divisible by 3. Hence m2 = 1, i.e. 
z G It follows that N2Z belongs to the elementary abelian 3-group N2/N2, and therefore has 
order 1 or 3. Thus (5.9.5) is proved in all cases. 

It follows from 5.9.4 that G2 is solvable and has odd order; thus N2 <l T is cosolvable and has odd 
index. According to 5.9.1, the odd-characteristic congruence kernel A'^i is not cosolvable. By 5.9.5 
we have m]y*{z) G {1,3}, and we observed above that uini = rni G {2,6}. In particular we have 
mj^*{z) 7^ m7Vi(-z)- Thus the hypotheses of Lemma 5.2 hold with N2 playing the role of N2 in 
the statement of that lemma. In the notation of Lemma 5.2, we have 9i = mi\;^{z) G {1,3} and 
62 = mN*{z) G {1,3}. Lemma 5.2 therefore gives a finite-index normal subgroup of L such that 
mM{z) G {1,3} and such that I'l^N) = vii{Ni) + i/^(A^|) for every prime I. Taking £ = 3, we find 
from 5.9.4 that 

= ^?AG*2) = dimz3(T*) > 3. 
By Assertion (7) of Lemma 3.6, the group Gi = T /Ni has order divisible by 3, and hence V2{Ni) = 
c"3(Gi) > 1. Thus we have 

iy^{N) = u,XNi) + iy^{N^) > 4. 

On the other hand, since mN{z) G {1,3}, we have z^ G A^. Thus the conclusion of the lemma holds 
in this case, with i = 3. □ 

6. Independence criteria in 3-manifold groups 

The main results of this section are Propositions 6.3 and 6.4, which provide the transition between 
the results of Section 5 and our geometric theorems. 

Lemma 6.1. Suppose thatTo is a torsion-free cocompact discrete subgroup o/Isom_|-(]H^), and that 
X and 7 are non-commuting elements o/Lq. Then no non-trivial power of ^x^~^ commutes with 
any non-trivial power of x. 

Proof. Since Lq is discrete, torsion-free and cocompact it is purely loxodromic. Hence Lq is torsion- 
free and the centralizer of every non-trivial element of Lq is cyclic. In particular any two commuting 
elements of T have a common non-trivial power. 

If xq denotes a generator of the centralizer of x, then xq is clearly a primitive element of T having 
a; as a power. 
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Suppose that 7x™'7~ commutes with x" for some m,n ^ 0. If 2; 7^ 1 is a common power of 
^^m^-i ^j^g centrahzer C of z contains both 7x07"^ and xq. Since 7x97"^ and xq are 

primitive and C is cychc, each of the elements 7x97"^ and xq is a generator of C, and hence 
7x07""^ = x^"^. In particular 7^ commutes with xq. Thus the centrahzer of 7^, which is also cyclic, 
contains both 7 and xq; since xq is primitive this means that 7 G (xq), and hence 7 commutes with 
X, a contradiction. □ 

Lemma 6.2. LetT be a discrete, cocompact subgroup o/Isom+(]H[^). LetT be a subgroup ofT such 
that dimxj iJi(T;Z^) > 4 for some prime I. Suppose that u and v are non-commuting elements of 
r, and that m and n are positive integers such that and vu^v~^ lie in T . Then and vu^v~^ 
are independent elements (2.1) ofT. 

Proof. It follows from Lemma 6.1 that and vu^v^^ do not commute. 
We claim: 

6.2.1. The group F = {u"^ ,vu^v~^) is not cocompact. 

To prove this, note that since F is discrete, torsion- free and cocompact we have T = 7ri(M) for 
some closed, orientable hyperbolic 3-manifold M. The hypothesis then implies that Hi{M; Z^) > 4 
for some prime £. It then follows from [25, Proposition 1.1] that any two-generator subgroup of 
7ri(M) has infinite index in tti(M). Hence F has infinite index in T and is therefore not cocompact. 
This proves 6.2.1. 

It follows from [4, Theorem 7.1] that any two-generator non-cocompact purely loxodromic subgroup 
of Isom-|-(]H^) is free. Hence by 6.2.1, F is free. But since and vu^v^^ do not commute, F is 
non-abelian. Hence it is free of rank 2, and so u"^ and vu'^v~^ are independent. □ 

Proposition 6.3. Let T < SL2(C) be a cocompact torsion-free lattice having integral traces, such 
that Hi(T;7js) = for s = 2,3 and 7. Let K denote the trace field of T. Let u and v be non- 
commuting elements of T. Suppose that u is a power of an element of T whose trace is swell in 
Ok- Then either 

(1) u and vu'^v~^ are independent elements ofT, or 

(2) V? and vv?v~'^ are independent elements ofT. 

Proof. Let us write u = , where A; is a positive integer and z is an element of T such that trace z 
is swell in Ok- Then tracer is swell in Oe by 2.7. Hence Lemma 5.8 gives a finite-index normal 
subgroup iV of r such that either (i) V2{N) > 4 and £ N, or (ii) K2{N) > 4 and z"^ G N. Set 
G = N/T, let vr : r — > G denote the quotient homomorphism, and set g = 7r(n). 

First suppose that (i) holds. Let T denote the 2-Sylow subgroup of N. Since = {z^)'' e N, we 
have = e; hence we may suppose T to be chosen within its conjugacy class so that g £ T. We 
have dim(ii'i(T);Z2) = ct2(G) = z^2(A^) > 4. 

Now set T = TT~^{T). Then tt\T : T ^ T is surjective, and hence dim(i/i(T); Z2) > 4. Since 
Tr{u) = g £ T, we have u £ T. Since N is normal and contains u^, we have vu'^v'^ £ N < T. 
It now follows from the case m = 1, n = 4 of Lemma 6.2, with T = T, that u and vu'^v~^ are 
independent in T. 

Now suppose that (ii) holds. We have dim(ifi(A^); Z2) = K2{N) > 4. Since N is normal and 
contains u"^ = (z^)'', it also contains vu'^v~^. We now apply the case m = n = 2 of Lemma 6.2, 
with T = iV, to deduce that and vv?v~^ are independent in T. □ 
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Proposition 6.4. Let T < SL2(C) be a cocompact torsion-free lattice having integral traces, such 
that Hi{r;7j2) and Hi{T;Z3) are trivial. Let K denote the trace field of T . Let u and v he non- 
commuting elements of T, and suppose that u is a power of element of T whose trace is nifty but 
not swell in Ok- Then and vu^v~^ are independent elements ofV. 

Proof. Let us write u = , where A; is a positive integer and z is an element of T such that trace z 
is nifty but not sweh Ok- It then fohows from 2.7 that tracer is nifty but not swell in Oe- Hence 
Lemma 5.9 gives a finite-index normal subgroup of F and a prime i G {2, 3} such that z^ ^ N 
and max(iyi(N) , K£(N)) > 4. Thus either v^^N) > 4 or /t^ {N)) > 4. Then N contains = {z^)^ . 
Since N is normal, it also contains vu^v"^. 

Set G = N/T, let vr : r — > G denote the quotient homomorphism, and set g = 7r(u). 

Let T denote the ^Sylow subgroup of A^. We have dim(i?i(T); Za) = a2{G) = V2{N) > 4. 

Set T = 7r~^(T). Then 7r|T : T ^ T is surjective, and hence 

(6.4.1) dim(i7i(f ;Z2)) > dim(i/i(r); Z2) = (j{G) = ue{N) > 4. 
On the other hand, we have 

(6.4.2) dim(i?i(iV; Z2)) = ue{N) > 4. 

We define a finite-index normal subgroup T of T as follows. If Vi{N) > 4 we set T = T. If 
ff(iV) < 4, in which case ki{N)) > 4, we set T = T. It then follows from (6.4.1) and (6.4.2) that 
in either case we have dim(ffi(T; Z2)) > 4. By definition we also have > T in either case, and 
hence and vu^v~^ belong to T. It now follows from the case m = n = 3 of Lemma 6.2 that 
and vu^v~^ are independent in T. □ 

7. Distances on a sphere 

Proposition 7.1. If Qi, . . . , Qn are points on S"^ , we have 

^ cos ds{Qi, Qj) > -n/2. 

l<i<j<n 

Proof. We regard as the unit sphere in E^, and we let Vi G denote the position vector of Qi. 
We have 



n n 

Vi 

i=l i=l 

n 



i=l i^j 

= n + 2 E {'"i,Vj) 

l<i<j<:n 

= 2(2+ E cos ds{Qi,Qj)), 

l<i<j<n 

from which the conclusion follows. □ 

Corollary 7.2. If I is a four-element index set and {Pi)i^i is an indexed family of points on S"^, 
there exist three distinct indices p, q,q' £ I such that 

cos{ds{Pp,Pg))+cos{d,{Pp,Pq,)) > -2/3. 
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Proof. We may assume that / = {1,2,3,4}. Let T denote the set of all ordered pairs of the form 
{p,S), where p is an index in {1,2,3,4} and 5 is a two-element subset of {1,2,3,4} that does not 
contain p. We have |T| = 12. For each element {p,S) of T, let us set a(p,5) = cos{ds{Pp, Pg)) + 
cos{ds{Pp, Pq')), where q and q' are the elements of S. For each pair (i, j) with 1 < i < j < 4, there 
are exactly four elements {p,S) of T such that one of the indices i,j is equal to p and the other 
belongs to S. Hence 

(7.2.1) «{P,5)=4 cos{ds{PuPj))>-8, 

{p,s)er i<i<j<4, 

where the final inequality follows from Proposition 7.1. Since there are 12 terms on the left-hand 
side of (7.2.1), we must have a(p,5) > —2/3 for some {p, S) £ T. □ 

8. Nifty traces and displacements 

In this section we give the proof of Theorem 1.6, after a good many technical preliminaries. 
We set 

V = {{T, fi,h) :0 <T <2, <h< fi} CK^. 
We define a real-valued function $ with domain T> by 

^{T, /i, h) = max ^2 arccosh((l + T /2) smb? fi + cosh h), arccosh(cosh /i cosh(/z — h)) +2fi^. 

Lemma 8.1. Let {T,fj,, h) £T> he given. Let AiBiCi and A2B2C2 be hyperbolic triangles such that 
d{Ai, Ci) < fi and d{Bi, Ci) < n for i = 1,2. Suppose that 

(8.1.1) cos{Z{AuCi, Bi)) + cos{Z{A2,C2, B2)) > -T. 
Then 

d{Ai,Bi) + d{A2,B2) < ^{T,fi,h). 

Proof For i = 1,2 we set a = d{Bi,Ai), hi = d{Ci,Ai), m = d{Ci,Bi), and ji = Z{Bi,Ci,Ai). By 
hypothesis we have 

(8.1.2) ai < n, hi < IX 
for i = 1, 2, and 

(8.1.3) cos 71 -|- cos 72 > —T. 
The hyperbolic law of cosines gives 

(8.1.4) cosh Ci = cosh hi cosh Oj — sinh hi sinh Oj cos 7^. 
We claim that for each i £ {i, 2} we have 

(8.1.5) cosh Ci < max((l — cos 7^) sinh^ fi + cosh h, cosh /i cosh(/i — h)). 

To prove 8.1.5 for a give i E {1,2}, we first consider the special case in which 7j > 7r/2. In this 
case we have cos7i < 0; hence (8.1.4), with (8.1.2), gives 

cosh Ci < cosh^ /i — sinh^ fi cos 7^ 

= (1 — cos 7j) sinh^ fx + 1 

< (1 — cos 7j) sinh^ -|- cosh h, 
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SO that (8.1.5) holds. Next consider the special case in which min(aj,6i) > ji — h. In this case, in 
view of (8.1.2), both Oj and 6, lie in the interval [/i — /i, /i]. Hence \ai — bi\ < h. Using (8.1.4), we 
find that 

cosh Ci = cosh(aj — 6j) + sinh Oj sinh 6j(l — cos 7^) 
< cosh h + (sinh^ /i)(l — cos 7^), 

so that again (8.1.5) holds. There remains only the case in which 7^ < tt/2 and min(aj, 6j) < fi — h. 
Then cos7i > 0, and by symmetry we may assume that ai < /j, — h. By (8.1.2 we have bi < /i. 
Using (8.1.4) we deduce that 

cosh a = cosh hi cosh m — sinh hi sinh Oi cos 7^ 
< cosh fi cosh(/i — h) , 

which again implies (8.1.5). Thus (8.1.5) is proved in all cases. 
Since (8.1.5) holds for each i £ {1,2} we have either 

(8.1.6) cosh Cj < (1 — cos 7j) sinh^ // + cosh for each i € {0, 1}, 
or 

(8.1.7) cosh Cj < cosh /i cosh (/X — /i)) for some i G {0, 1}. 

If (8.1.6) holds then, using (8.1.3) we find that 

cosh ci + cosh C2 < (1 — cos 71) sinh^ fi + cosh h + {1 — cos 72) sinh^ jj, + cosh h 
< (2 - r) sinh^ fi + 2 cosh h. 

Since cosh is a convex function, we have 

cosh ^ ^ ^ ^ ^ < -(coshci +coshc2) 

< ^ ((2 - T) sinh^ fi + 2 cosh h) , 

so that 

ci + C2 < 2 arccosh((l - T/2) sinh^ fi + cosh h) < <!>{T, /i, h). 
This is the conclusion of the lemma in this case. 

It remains to prove that if (8.1.7) holds then we still have ci + C2 < ^{T, fj,,h). By symmetry we 
may assume that coshci < cosh /u cosh (/i — h)), so that 

ci < arccosh(cosh/i cosh(^ — h)). 

By the triangle inequality and (8.1.2) we have 

C2 < 02 + 62 < 2fl. 

Hence 

ci + C2 < arccosh(cosh // cosh(/i — h)) + 2/i < ^{T,ij,, h). 

□ 
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Lemma 8.2. Let {T,fi,h) £ V and n > 4 be given. Let P be a point ofM^, and let (,i,...,S,n 
be isometries of such that d{P,(^i ■ P) < for i = 1, . . . ,n. Suppose that there exist indices 
k, I G {1, . . . ,n — 1}, with I > k + 2, such that 

(8.2.1) cos(Z(4T^ • P, P, 6+1 • P)) + cos(Z(^r' " 6+i • P)) > -T. 
Set (i = ^1^2 ■ ■ ■S.n- Then 

d{P, P-P)< $(r, fi, h) + {n- 

Proof. For i = 1, n set Pi = • P and Di = d{P, Pi), so that Di < fi. Set fij = £,i ■ £,2 - ■ ■ Cj 
for < j < n, so that = 1 and /3„ = (3. Set Qj = Pj • P, and note that Qj = I3j-i ■ Pj. For 
< m < m' < n, the triangle inequality gives 

m 

d{Qm,Qm')< d{Qj^i,Qj) 
j=m+l 
m 

= d{f3,.,.P,p,.,.P,) 

j=m+l 
m 

= E 

j=m+l 

and hence 

(8.2.2) d{Qm, Qrn') < {m' - m)fi. 

Note that (8.2.2) is trivial when m = m! , and is therefore true whenever < m < m' < n. Hence 
(8.2.3) 

d{P, p-P)< d{Qo, Qk^i) + d{Qk-i,Qk+i) + d{Qk+i,Qi^i) + d(Qi_i, Q/+i) + d{Qi+i,Qn) 
< (n - 4)/i + d{Qk^i,Qk+i) + d{Qi^i,Qi+i). 

Next we note that if i is equal to either k or I, we have 

(8.2.4) d{Qi^i, Q^+l) = d(A_i • P,Pi,lU^+l ■ P) = d(er' • ^,^^+1 • P)- 

We have d{£^^ ■ P, P) = Di < fx, and (i(^i+i ■ P,P) = -Dj+i < fi foi i = k, I. By hypothesis we have 

cos(z(efc-i • p, p, Cfc+i • p)) + cos(z(er' • p, p, ii+i ■ p)) > -t. 

Hence we may apply Lemma 8.1, with Ai = A2 = P, Bi = ■ P, B2 = £,f^ ■ P, Ci = £k+i " P 
and, C2 = Ci+i • P, to conclude that 

(8.2.5) d(47^ • P, 6+1 • P) + d{Cr' ■ P, 6+1 • P) < ^(T, fx, h). 
From (8.2.3), (8.2.4) and (8.2.5) it follows that 

d{P, P-P)<{n-A)fi + d{Qk^i,Qk+i) + d{Qi^i,Qi+i) 

= {n- 4)^ + d{^,' ■ P, (k+i ■ P) + d{^r' ■ P, Ci+i ■P)<{n- 4)fi + ^T, fi, h), 
which is the conclusion of the lemma. □ 



We will also need the following variant of Lemma 8.2. 
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Lemma 8.3. Let T £ [—1,1] o,nd fi > be given. Let P he a point o/ H^, and let (,i,---,S,n 
be isometries of such that d{P,^i • P) < fJ- for i = 1, . . . ,n. Suppose that there is an index 
A; G {1, . . . , n - 1} such that cos(Z(^^^ • P, P, ^^+1 • P)) > -T . Set (3 = 6^2 • • • Cn- Then 

d{P, P ■ P) < arccosh(cosh^ ji + Tsinh^ /i) + (n — 2)fi. 

Proof. We set Pj = • ^2 • • • S,j for < j < n, so that Po = I and = (3- By the triangle inequality 
we have 

fc-2 n-l 

(8.3.1) d{P,p- P) < {J2 d{Pi ■ P, A+i • P)) + d{(3k-i ■ P,Pk+i ■P) + {T. ■ P,Pi+i ■ P)), 

i=0 i=k+l 

where the sums J2iZo d{Pi ■ P, A+i • P)) and J2^=k+i d{Pi ■ P, Pi+i ■ P) are interpreted as being in 
the respective cases k = 1 and k = n — 1. For i = 0, . . . , n — 1 we have 

(8.3.2) d{pi ■ P, Pi+i ■P)) = d(A • P, Piij+i • P)) < /i. 
Furthermore, 

d{(3k-l ■ P, /3k+l • P) = d{(3k-l ■ P, ^3k~lC^C^+l ■ P) 

(8.3.3) =d{P,CiCi+i-P) 

= d{Ci' ■P,^i+i-P). 

Applying the hyperbolic law of cosines to the triangle with vertices ^^^^ ■ P, P and • P, we find 
that coshd{^~^ ■ P, ^j+i • P) is equal to 

cosh d{^;^ ■ P, P) cosh d{P, ■ P) - sinh d(eri . P, P) sinh d{P, ^,+1 • P) cos(Z(er^ • P, P, Ci+i ■ P)), 
and hence that 

(8.3.4) cosh d(^-^ • P, P) < cosh^ fi + sinh^ ^ + Tsinh^ jj. 

The conclusion of the lemma follows immediately from (8.3.1), (8.3.2), (8.3.3) and (8.3.4). □ 
8.4. We now turn to the proof of Theorem 1.6, which was stated in the introduction. 

Proof of Theorem 1.6. We consider the unit sphere S in the tangent space to H^. We let / denote 
the set {x, x~^, y, y~^} C F. For each t G / let rt denote the ray from P to t ■ P, and let Qt £ S 
denote the unit tangent vector to the ray Vf. Then for any two distinct indices s,t £ I, we have 

(8.4.1) ds{Qs,Qt) = As-P,P,t-P). 

We claim: 

8.4.2. There exist non- commuting elements u and v of I that at least one of the following alterna- 
tives holds: 

(i) cos{ds{Qy-i,Qu)) +cos(ds(Qt,-i,Qu-i) > -2/3, or 

(a) cos(4(Qi,-i,(5n)) +cos(4(Qu-i,Qu)) > -2/3. 

To prove this, first note that according to Corollary 7.2, there exist three distinct indices ^, r],r]' £ I 
such that 

cos{ds{Q^,Qij)) + cos{ds{Q^,Qn')) > -2/3. 
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Because ^, rj, rj' are distinct elements of /, two of them must be inverses of each other. By symmetry 
we may assume that either r]' = rj~^ ov r]' = . li r]' = r]~^ , let us set u = r] and v = Then 
we have 

cos{ds{Qy-i,Qu)) + cos{ds{Q^-i,Qu~i)) > -2/3, 
which gives Alternative (i) of 8.4.2. If tj' = let us set u = and v = rj^^. Then we have 

cos(4(Q«,Qi,-i)) + cos(4((3«,(5m-i)) > -2/3, 
which gives Alternative (ii). Thus 8.4.2 is established. 

We now fix non-commuting elements u and v of the set {x, x^^, y, y^^} C T such that one of the 
alternatives (i), (ii) of 8.4.2 holds. Note that trace n G {trace x, trace y}. In particular, 

8.4.3. The element u is a power of some element zofT whose trace is nifty. 

We now set fj, = 0.3925 if trace z is swell, and fi = 0.3 otherwise. We are required to prove that 
max(d{P,x ■ P),d{P,y ■ P)) > fi. For the rest of the argument I will assume that max(d{P,x ■ 
P), d{P, y ■ P)) < /U, and eventually this will produce a contradiction. 

Since u and v are non-commuting elements of {x, x~^, y, we have 

(8.4.4) max{d{P,u ■ P),d{P,v ■ P)) = max{d{P, x ■ P) , d{P, y ■ P)) < ^i. 

We claim: 

8.4.5. For every integer m >3 and every real number h with < h < fi, we have 

d{P, vu^'v"^ ■ P) < ^>(2/3, /i, h) + {m- 2)/x. 

The proof of 8.4.5 is an application of Lemma 8.2. Given any integer m > 3, let us set n = m + 2, 
= V, (,i = u for i = 2, . . . , m -|- 1 and = It follows from (8.4.4) that d{P, ■ P) < IJ- 

for i = 1, . . .m + 2. According to Lemma 8.2, the inequality asserted in 8.4.5 will hold if there are 
indices k,l £ {1, . . . ,m + 1}, with l>k + 2, such that (8.2.1) holds with T = 2/3. 

If Alternative (i) of 8.4.2 holds, then by 8.4.1 we have 

-2/3 < cos(Z(?;"^ ■ P, P,u ■ P)) + cos{Z{u-^ ■ P, P,v-^ ■ P)) 

= cos(Z(^ri • P, P, 6 • P)) + cos(Z(C„i^.i • P, P, U+2 -P))- 

Thus (8.2.1) holds with T = 2/3, k = \ and I = m + 1. We have — / = m > 3 in this case. 

If Alternative (ii) of 8.4.2 holds, then by 8.4.1 we have 

-2/3 < cos(Z(t;^^ ■P,P,u- P)) + cos(Z('u"i ■P,P,u- P)) 

= cos(Z(er' • P, P, 6 • P)) + cos(Z(e-i • P, P, U+i ■ P)). 

Thus (8.2.1) holds with T = 2/3, k = 1 and / = m. We have A: — / = m — l>2in this case. This 
completes the proof of 8.4.5. 

There is a counterpart of 8.4.5 for the case m = 2. We claim that 
(8.4.6) d{P, vu^v^^ ■ P) < arccosh(cosh^ ^ ~^ 2 ^^^^^ ~^ 

To prove 8.4.6, we first note that by Proposition 7.1 we have 

cos{ds{Qu,Qu~i)) + cos{ds{Qu,Qv-^)) + cos{dsiQu-i,Qv~i)) > -3/2. 
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Hence at least one of the terms cos{ds{Qu,Qu-^))i cos{ds{Qu,Qv-^)) cos(ds((5„-i, Q^-i)) is 
bounded below by —1/2. If we set ^1=^,^2 = ^3 = '^ and ^4 = v~^, this means that 
cos{ds{Q^^,Q^-i )) > —1/2 for some k £ {1,2,3}. In view of (8.4.1), this in turn means that 

cos(z(efc-p,p,e,7;i-p))>-i/2 

for some k G {1,2,3}. Thus the hypothesis of Lemma 8.3 holds with n = 4 and T = 1/2. The 
inequality (8.4.6) is now simply the conclusion of Lemma 8.3. 

The rest of the proof is divided into cases depending on whether trace z is swell or not. We first 
consider the case in which tracer is swell. In this case we have fi = 0.3925 by definition. Since 
trace z is swell, u is a power of z, and u and v do not commute, it follows from Proposition 6.3 that 
either u and vu'^v~^ are independent elements of T, or and vu'^v~^ are independent elements of 

r. 

Consider the subcase in which u and vu'^v~^ are independent. By (8.4.4) we have d{P, u - P) < fi = 
0.3925. Applying 8.4.5, with m = 4 and h = 0.07, we obtain 

d(P, vu'^v-^ ■ P) < ^>(2/3, 0.3925, 0.07) + 0.3925 x 2 = 2.084 .... 

Applying Theorem 2.2 with X = u and Y = vu'^v~^, we find 

1/2 > + exp{d{P,u ■ P)) + 1/(1 + exp{d{P,vu\-^ ■ P)) 

> 1/(1 +exp(0.3925)) + 1/(1 + exp(2.09)) 
= 0.513..., 

a contradiction. 

Now consider the subcase in which u'^ and vu'^v^^ are independent. From (8.4.4) it follows that 
d{P,v?' • P) < 2/x = 0.785, and according to 8.4.6 we have 

d{P, vu^v^^ ■ P) < arccosh(cosh^ ^ ~^ 2 ^^^^^ ~^ '^^ 

= arccosh(cosh2 (0.3925) + ^ sinh2(0.3925)) + 0.785 
= 1.469007.... 

Applying Theorem 2.2 with X = u'^ and Y = vu^v~^, we find 

1/2 > 1/(1 +exp{d{P,u'^ ■ P)) + l/{l + exp{d{P,vu'^v-^ ■ P)) 

> 1/(1 +e2x0.3925))^ 1/(1 ^gl.46901)^ 

= 0.5003..., 

and again we have a contradiction. 

We now turn to the case in which trace z is not swell. In this case we have fi = 0.3 by definition. 
By 8.4.3, is a power of z and trace z is nifty . Since trace z is not swell, and since u and v do not 
commute, it follows from Proposition 6.4 that and vu'^v~^ are independent elements of T. 

From (8.4.4) it follows that d{P, ■ P) < 3fi = 0.9. Applying 8.4.5 with m = 3 and /i = 0.06, we 
obtain 

d{P, vu^v"^ ■ P) < ^>(2/3, 0.3, 0.06) + 0.3 = 1.29 .... 
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Applying Theorem 2.2 with X = v? and Y = vv?v^^^ we find 

1/2 > 1/(1 +exp(d(P,M^ • P)) + 1/(1 + exp((i(P,wti^t;-^ • P)) 
< 1/(1 + e°-9)) + 1/(1 + ei-3)) 
= 0.503..., 

and we have a contradiction in this case as welL □ 

Corollary 8.5. Let T < SL2(C) be a non- elementary torsion-free discrete group having integral 
traces, and suppose that Hi{T;7jp) = for p = 2,3 and 7. Let K denote the trace field of T . 
Suppose that 

(8.5.1) trace 7 S Ok is nifty for every 7 E F — {1} with length(n(7)) < 0.3. 
Then 0.3 is a Margulis number for n(r). 

Furthermore, if we assume that 

(8.5.2) trace 7 G Ok is swell for every 7 G T — {1} with length(n(7)) < 0.3925, 
then 0.3925 is a Margulis number for n(r). 

Proof. If (8.5.2) holds, we set /i = 0.3925. If (8.5.1) holds but (8.5.2) does not, we set 11 = 0.3. If 
X and y are non-commuting elements of T we must show that 

max{d{P, X ■ P),d{P,y P)) > fi. 

This is trivial if either n(x) or n(y) has translation length greater than /i. We therefore assume 
that length(n(x)) and length(n(?/)) are at most fx. If (8.5.2) holds, so that fj, = 0.3925, it follows 
that trace X and trace y are swell, and the second assertion of Theorem 1.6 implies that 

max{d{P,x ■ P),d{P,y P)) > 0.3925 = ^u. 

If (8.5.1) holds (8.5.2) does not, so that fi = 0.3, it follows that trace x and trace y are nifty, and 
the first assertion of Theorem 1.6 implies that 

max{d{P,x ■ P),d{P,y P)) > 0.3 = ^u. 

□ 

9. The case of a quadratic trace field 

The main result of this section is Theorem 9.3, which will easily imply Theorem 1.1 of the intro- 
duction. 

Proposition 9.1. Let ^ be a positive real number, and let A be an element o/SL2(C) such that 
length(n(74)) < Set traced = + irj. Then 

t: \ 2 / N 2 



^^'^'^^ V2cosh(/i/2)j ^Usinh(^/2)j " ^" 

Proof Set l + ie = Clength(n(^)), and set z = exp((/ + ie)/2). By (2.5.2), we have 

1 z 

^ + irj = ±2 cosh(Clength(n(A))/2) = z + - = z + --^ = z + e~h. 

z z 
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Hence if x and y denote the real and imaginary parts of z, we have = x(l + e ') and rj = y{l — e ') 
so that 



2 / N 2 



2 I „.2 I |2 „Z 



+ —r] =x +y =\z\ =e. 



This gives 

2 / \ 2 



+ „ . =1 



,2cosh(Z/2)/ V2sinh(V2), 
from which the conclusion follows. □ 

Lemma 9.2. IfV < SL2(C) be a cocompact discrete group, the trace field of K cannot be a subfield 
o/R. 



Proof. Since T is cocompact, it is non-elementary. If C R, it follows from [6, proof of Theorem 
5.2.1] that r is conjugate to a subgroup of SL2(R); but this contradicts cocompactness. □ 

Theorem 9.3. Let T < SL2(C) be a cocompact discrete group having integral traces. Suppose that 
Hi(T;Zp) = for p = 2,3 and 7, and that the trace field of T is a quadratic number field. Then 
0.3925 is a Margulis number for Il{T) . 



Proof. Let K denote the trace field of T. It follows from Lemma 9.2 that K is an imaginary 
quadratic field. By Corollary 8.5, it suffices to prove that if 7 is a non-trivial element of T with 
length(n(7)) < 0.3925, then r = trace 7 G Or is swell. 

Let ^ and r/ denote the real and imaginary parts of r. Since length(n(7)) < 0.3925, the inequality 
(9.1.1) holds with = 0.3925. In particular we have 

(9.3.1) \r]\ < 2 sinh(0.3925/2) = 0.395 . . . 

Suppose that r is not swell. By definition this means that either r is a unit of Ok or that — 2 
is a unit. 

First suppose that r is a unit. Since K is an imaginary quadratic field, we have r = ±1, r = iti, 
or r = {l±iV3)/2. Since 7 is loxodromic we cannot have r = ±1. If r = ±i, or r = 
then rj > 1/2, and (9.3.1) is contradicted. 

There remains the possibility that — 2 is a unit. In this case we have - 2 = ±1, - 2 = ±i, or 

— 2 = (1 lb iy/S) /2. If — 2 = ±1 then either r = ib-v/3 or r = ibi. Since K is an imaginary field, 
it cannot contain \/3. If r = iti then r is a unit in K, and we have already ruled out this case. If 

- 2 = ±i, so that r = ±^/2±i, or if - 2 = (1 ± iVS)/2, so that r = ±((5 ± f\/3)/2)i/^ then 
|rp is irrational, which is a contradiction since r is an element of an imaginary quadratic field. □ 

9.4. We conclude this section with the 



Proof of Theorem 1.1. Suppose that M satisfies the hypothesis of Theorem 1.1. Let us write 
M = H^/To for some torsion-free discrete subgroup Lq of Isom+(]HI'^). According to 2.4, Lq is 
the isomorphic image under 11 of a cocompact (and torsion-free) subgroup T of SL2(C). The hy- 
potheses of Theorem 1.1 now immediately imply those of Theorem 9.3. The latter theorem therefore 
implies that 0.395 is a Margulis number for P(r) = Tq and hence for M. □ 
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10. The case of a cubic trace field 

The main result of this section is Theorem 10.4, which will easily imply Theorem 1.2 of the intro- 
duction. 

Lemma 10.1. Let K C C be a degree-3 extension o/Q such that K (^H, and let p he an element of 
Ok- Suppose that both p and p—1 are units in Ok- Then either (i) \ Im(p)| > 0.36, or (ii) there is 
a unit £ Ok with minimal polynomial + X'^ + 1, such that either p = or p = = ^^^ + 1. 

Proof. If /9 G Q, then since p and p—1 are units in Ok, we have {p,p — 1} C {1, —1}, which is 
impossible. Hence p ^ Q. Since K has degree 3 over Q it follows that K = Q(p) and hence that 
p ^ R. In particular, if we regard K as an abstract number field, it has a complex place. Since K 
is cubic it cannot have more than one complex place. In particular the number of complex places 
is odd, and hence by [26, Lemma 2.2], K has negative discriminant. 

The hypothesis of the lemma implies that pi = p and P2 = 1 — p are units. Since pi and p2 satisfy 
the unit equation pi + p2 = 1, and lie in the cubic field K of negative discriminant, it follows from 
[20, Theoreme 2] that either 

(1) there is a unit ^ G Ok with minimal polynomial X^ + X'^ — 1 such that the unordered pair 
{pi, P2} is one of the pairs on the following list: 

{e, f}, f}, {-r\c'}, {-c, r'}, r'}, {-r\ c'y, or 

(2) there is a unit r] E Ok with minimal polynomial X^ + X — 1 such that the unordered pair 
{pi, P2} is one of the pairs on the following list: 

{V,v^},{-V'^,r]'^},{-V\v~^}- 

In case (1), since p ^ R, the root ^ of X^+X^ — 1 is imaginary. Hence ^ = —(0.877 . . .)ibi(0.744 . . .). 
Calculating powers of we find that for —3 < m < —1 and for 1 < m < 5 we have | Im(^)| > 0.5, 
whereas Im(^~^) = 0.18 .. .. Note that since ^ is a root of X^ + X'^ — 1, we have = ^"'^ + 1. 
The conclusion of the lemma follows in this case. 

In case (2), since p ^ R, the root rj oi X^ + X - 1 ^ is imaginary. Hence rj = —(0.34116 . . .) + 
i(l. 16154 .. .). Calculating powers of r] we find that for m = —1 and for 1 < m < 3 we have 
I Im(r/)| > 0.7, whereas Im(?7~^) = 0.368 . . .. Note that since r/ is a root of X^ + X — 1, we have 
_ _|_ hence Im(r/~'^) = 0.368 . . .. Thus the conclusion of the lemma holds in this case 

as well. □ 

The following result is analogous to Lemma 10.1. 

Lemma 10.2. Let K be a degree-3 extension of Q with K H, and let p be an element of Ok- 
Suppose that both p^ — 2 and p — 1 are units in Ok- Then either Imp > 0.36, or the minimal 
polynomial of p over Q is X^ + 3X^ — lAX + 11. 

The proof of Lemma 10.1 was based on the results of [20]. As I am not aware of any similar sources 
that can be quoted in proving Lemma 10.2, I have provided a proof from scratch. This proof, which 
involves some rather laborious calculations, will be found in an appendix to this paper. Section 12. 

Lemma 10.3. Let K be a degree-3 extension of <Q with K <^ Tl, and let r be an element of Ok- 
Then one of the following alternatives holds: 
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(i) T is nifty; or 
(a) Im(T) > 0.36; or 

(Hi) one of the elements r, — r, 1 + r or 1 — t of Ok has minimal polynomial X'^ + 2X^ — 3X + 1; 
or 

(iv) one of the elements r or — r of Ok has minimal polynomial X^ + — lAX + 11. 

Proof. Suppose that Alternative (i) does not hold. Then in particular either r — lorr + lisa unit 
in Ok- Hence there is some e S {1, —1} such that er — 1 is a unit. We set p = er, so that p — 1 is 
a unit. 

The assumption that Alternative (i) does not hold also implies that either r is a unit in Ok, or 
— 2 is a unit in Ok- 

We first consider the case in which r is a unit. Then p and p — 1 are units. Lemma 10.1 then 
implies that either |Im(p)| > 0.36, or one of the elements p or p — 1 has the form where ^ 
is a unit in Ok having minimal polynomial X'^ + X'^ — 1. If | Im(p)| > 0.36, then since p = itr. 
Alternative (ii) of the present lemma holds. 

If either p or p—1 has the form where ^'^+^^ — 1 = 0, we have = (1— ■C^)^ and hence ^^+2,^^ = 
+ 1. Squaring both sides of the latter equality and simplifying, we obtain (^^'^ + 3x^ + 2x'^ — 1 = 0. 
Hence — is a root of the polynomial X^ + 2X^ — 3X + 1, which is irreducible by the rational 
root test and is therefore the minimal polynomial of — The latter element is equal to either 
ibr or 1 ± T, and hence Alternative (iii) of the present lemma holds. 

We now turn to the case in which — 2 is a unit in Ok- In this case /o^ — 1 and p — 1 are units. 
By Lemma 10.2, either Im p > 0.36, or the minimal polynomial of p is X^ + 3X^ — 14X + 11. Since 
p = ±T, one of the alternatives (ii) or (iv) of the present lemma holds. □ 

Theorem 10.4. Let T < SL2(C) be a discrete cocompact group having integral traces, such that 
Hi(T;Zs) = for s = 2,3,7. Suppose that the trace field of T is a cubic field. Then 0.3 is a 
Margulis number for Il(T) . 

Proof. Let K denote the trace field of T. According to Lemma 9.2 we have K i^Tl. 
If X and y are non-commuting elements of T we must show that 
(10.4.1) max{d{P,x - P),d{P,y - P)) > 0.3. 

This is trivial if either Il(x) or H(y) has real translation length greater than 0.3. We therefore 
assume that length(H(x)) and length(H(y)) are both at most 0.3. 

If traces; and trace y are nifty, the first assertion of Theorem 1.6 implies that (10.4.1) holds. Hence 
we may assume that trace x and trace y are not both nifty, and by symmetry we may assume that 
r = trace x is not nifty. 

Thus r does not satisfy Alternative (i) of Lemma 10.3. Since length(n(x)) < 0.3, it follows from 
Proposition 9.1 that | Im(T)| < 2 sinh(0.3/2) = 0.301 . . ., so that r does not satisfy Alternative (ii) 
of Lemma 10.3 either. Hence r must satisfy one of the alternatives (iii) or (iv) of Lemma 10.3. It 
follows that for some p E {r, — r}, either the minimal polynomial of p or p + 1 is X^ + 2X'^ — 3X + 1, 
or the minimal polynomial of 1 — p is + 3X^ — 14X + 11. If 1 — p has minimal polynomial 
X^ + 2X2 -I, then p has minimal polynomial -((1 - Xf + 2(1 - Xf - 3(1 - X) + 1) = 
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— + AX — 1. Hence the minimal polynomial of p is one of the polynomials 

(10.4.2) X^ - bX"^ + 4X - 1, X^ + _ _^ qj. ^3 _^ 3^2 _ -^^^ ^ 

We set pq = /9, and for each r > 1, we set pr = trace(x2'^). In particular and for any r > we have 
trace(x2'^) = i/j^, and hence 

(10.4.3) pr+i = trace((a;^)2) = (trace(x''))2 -2 = pi -2. 
We claim: 

10.4.4. For each r > 0, the element pr is primitive in K. 

This is clear for r = because the three possibilities for the minimal polynomial of po listed in 
(10.4.2) are all cubic. If pr is primitive in K for a given r, then since Pr+i = Pr ~ (10.4.3), 
and since ii' is a degree-3 extension K of Q, the element Pr+i is also primitive in K. This proves 
10.4.4. 

We let fr{X) denote the minimal polynomial of pr, which by 10.4.4 has degree 3. Then /o is one of 
the three polynomials listed in (10.4.2). For each r > we write fr{X) = X^ + brX'^ + CrX + dr- 

Note that if r > 1 is an integer such that \dr\ 7^ 1 and |dr+i| 7^ then the minimal polynomials 
and fr+i of Pr andpr+i both have constant terms 7^ ±1, and hence pr and pr -\- 1 are non-units 
in Ok- Since pr = trace(x2'^) by definition, and since pr+i = — 2 by (10.4.3), this means by 
definition that trace(x2'^) is swell. This shows: 

10.4.5. If r > 1 is an integer such that \dr\ 7^ 1 and \dr+i\ 7^ 1, then trace(a;2'^) is swell. 

For any r > 0, since = fr{pr) = P^ + WPr + CrPr + dr, we have pf + CrPr = —{brPr + dr), and 
hence (p^ + CrPr)'^ = {brpf. + dr)^. This gives 

pf. + (2c, - bl)pj + (c^ - 2brdr)pl -dl = 0. 

Since p^. = pr+i + 2 by (10.4.3), it follows that 

{pr+l + 2f + {2Cr - bl){pr+l + 2f + (c^ - 26,(i,)(Pr+l + 2) - = 0, 

SO that pr+i is a root of the cubic polynomial 

X^ + {2cr -bl + 6)X2 + (c,2 + 8cr - Abl - 2brdr + 12)X + {2c}. + 8cr - Abrdr - Abl -dl + 8), 
which must therefore be its minimal polynomial fr+i{X). This shows that 

6^+1 = 2cr — bf, +6, 
(10.4.6) Cr+i =cl + 8cr - Abl - 2brdr + 12, and 



dr+i = 2cl + SCr - Abrdr - 46^ - + 8. 



We now claim: 



10.4.7. // d{P,x ■ P) < 0.3, then there is an integer k > such that trace(x'^) is swell and 
d{P,x''-P) < 0.401. 
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In proving 10.4.7 I will take Clength(n(x)) (2.5) to be normalized so that its imaginary part lies 
in (— TT, tt]. For any positive integer m, the complex length of n(x"^) (not necessarily normalized) 
is m Clength(n(x)). I will denote by s the distance from s to the axis of n(x), and for each m > 1 
I will set Dm = d{P, Il{x)'^ ■ P). The hypothesis of 10.4.7 implies that Di < 0.3. By the discussion 
in 2.5, we have s = Li;(m Clength(n(x)), D^) for every m > 1. 

The proof of 10.4.7 is divided into three cases corresponding to the possibilities for the polynomial 
/o listed in (10.4.2). First suppose that fo{X) = — 5X^ + 4X — 1. From successive applications 
of (10.4.6) we find that di = —49 and d2 = 4487. Hence by 10.4.5, trace(x^) is swell. On the other 
hand, since p is an imaginary root of — 5X^ + 4X — 1, we have p = (0.4602 . . .) it i(0. 182582 . . . ). 
By (2.5.2) we then find that Clength(n(2;)) = (0.1872 . . .)± (0.8528 . . .)iTr. Hence, computing values 
of uj from the defining formula (2.5.1), and following the conventions described in 2.8, we find that 

s = u;(Clength(n(3;)), L»i) < w(Clength(n(x)), 0.3) = 0.120 . . . , 

and that 

6j(2Clength(n(x)), 0.395) = 0.13... > s = cj(2 Clength(n(x)), L>2). 

Next suppose that fo{X) = X^ + 2X^ — 3X + 1. From successive applications of (10.4.6) we find 
that di = —23 and d2 = 53. Hence by 10.4.5, trace(x^) is swell. On the other hand, since p is an 
imaginary root ofX^ + lX"^ -3X + 1, we have p = (0.539797. . .) ± z(0. 182582 .. .). By (2.5.2) we 
then find that Clength(n(x)) = (0.18927 . . .) ± (0.8268 . . .)iir. Hence 

s = uj{Clength(Jl{x)),D) < a;(Clength(n(x)), 0.3) = 0.1205 . . . , 

and 

w(2Clength(n(x)), 0.401) = 0.121... > s = w(2 Clength(n(x)), 1)2). 
It follows that D2 < 0.401. Thus 10.4.7 holds with A; = 2 in this case. 

The remaining case of 10.4.7 is the one in which fo{X) = X"^ + 3X^ — lAX + 11. In this case, from 
successive applications of (10.4.6) we find that d2 = —2569 and ds = 6578647. Hence by 10.4.5, 
trace(2;^) is swell. On the other hand, since p is an imaginary root of X"^ + 3X^ — lAX + 11, we 
havep = (1.38068... )±i(0.05457...). By (2.5.2) we then find that Clength(n(x)) = (0.0753... )± 
(0.5153 .. .)i7r. Hence 

s = u;(Clength(n(x)), D) < u;(Clength(n(x)), 0.3) = 0.19 . . . , 

and 

a;(4Clength(n(a;)),0.32) = 0.29... > s = a;(4 Clength(n(a;)), D4). 

It follows that < 0.32. Thus 10.4.7 holds with A; = 4 in this case, and is now established in all 
cases. 

We now assume that m.ax{d(P, x- P), d{P, y ■ P)) < 0.3. We will show that this leads to a contradic- 
tion, thereby completing the proof of the theorem. Since in particular d{P, x ■ P) < 0.3, it follows 
from 10.4.7 that we may fix an integer k > such that trace(x'^) is swell and d{P,x^ ■ P) < 0.401. 

It follows from Lemma 6.1 that x^ and yx^y^^ do not commute. In particular, x^ and y do not 
commute. 

Since trace(x^) is swell and y does not commute with x^, it follows from Proposition 6.3 that either 
x*^ and yx^^y~^ are independent elements of F, or x'^^ and yx^^y~^ are independent elements of F. 
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If and yx^^y ^ are independent elements of F, then by Theorem 2.2 we have 

('14 1 ^ ^ < 1/2 

^ ' ' > 1 + exp(d(P, x^ ■ P) 1 +exp((i(P,y2;4fcy-i • P) " ^ 

On the other hand, since d{P, x^ ■ P) < 0.401 and d{p, y ■ P) < 0.3, the left side of (10.4.8) is 
bounded below by 

1 1 

+ 77^ 77^ ^-TTTTV = 0.5004 .... 



1 + exp(0.401) 1 + exp(2 x 0.3 + 4 x 0.401) 
Thus we have the required contradiction in this case. 

If x"^^ and yx'^^y~^ are independent elements of T, then by Theorem 2.2 we have 

1 1 
"^^ l + exp(d(P,x2fc.p)) + l+exp(d(P,yx2V^-^)) " ^ 

On the other hand, since d{P, x^ ■ P) < 0.401 and d{p, y ■ P) < 0.3, the left side of (10.4.9) is 
bounded below by 

1 1 

+ r. 77i — — — - — ——^ = 0.507 .... 



1 + exp(2 X 0.401) 1 + exp(2 x 0.3 + 2 x 0.401) 
Thus we have the required contradiction in this case as well. □ 

10.5. We conclude with the 



Proof of Theorem 1.2. Suppose that M satisfies the hypothesis of Theorem 1.2. Let us write M = 
H^/ro for some torsion-free cocompact discrete subgroup Tq of Isom+(]H[^). According to 2.4, Tq 
is the isomorphic image under 11 of a cocompact (and torsion- free) subgroup T of SL2(C). The 
hypotheses of Theorem 1.2 now immediately imply those of Theorem 10.4. The latter theorem 
therefore implies that 0.3 is a Margulis number for i-'(r) = Tq and hence for M. □ 



11. FiNITENESS OF NON-NIFTY ELEMENTS AND APPLICATIONS 



In this section I will prove Theorems 1.3 and 1.4, which were stated in the introduction. These will 
be established by combining Propositions 6.3 and 6.4 and some of the geometric ideas of Section 
8 with some deep results from number theory. Theorem 11.2 summarizes the number-theoretic 
information that I will need. 

Proposition 11.1. Let L be a number field and let c be an element of O^. Then there are at most 
finitely many ordered pairs (ui, U2) of units in Ol such that 

(11.1.1) Ui + U2 = C. 

Proof. This is easily deduced from [16, Theorem D.8.1], a result due to Siegel and Mahler. We let 
L play the role of K in [16, Theorem D.8.1], and we choose the finite set S of absolute values in 
such a way that 1/c G Rs; this can be done by taking S to consist of all archimedean absolute 
values, together with all the non-archimedean absolute values w such that \c\w > 1- Now if ui and 
U2 are units in Ol satisfying (11.1.1), then vi = ui/c and V2 = U2/C are units in Rs whose sum is 
1. According to [16, Theorem D.8.1], there are only finitely many possibilities for vi and V2, and 
hence for ui = cvi and U2 = CV2. □ 

Theorem 11.2. If K is an arbitrary number field, Ok contains at most finitely many non-nifty 
elements. 
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Proof. If r G Ok is not nifty, in particular it is not swell. By definition this means that either r 
is a unit or — 2 is a unit. Hence it suffices to prove that (a) there are at most finitely many 
non-nifty units in Ok, and (b) there are at most finitely many elements r G Ok such that — 2 
is a unit. 

To prove (a), suppose that r is a non- nifty unit. Since r is non- nifty, the definition implies that 
either r — lorr + lisa unit. If r — 1 is a unit, then ui = t and U2 = 1 — t are units satisfying 
(11.1.1) with c = 1. If r + 1 is a unit, then ui = — r and U2 = 1 + t are units in Ok satisfying 
(11.1.1) with c = 1. Hence (a) follows from the case of Proposition 11.1 in which L = K and c = 1. 

To prove (b), suppose that r is an element of Ok such that — 2 is a unit. Consider the number 
field L = i^[\/2] D K. In Ol we have - 2 = (r - \/2)(r + \/2), and hence r - \/2 and r + \/2 
are units. Hence ui = \f2 — r and U2 = \/2 + r are units satisfying (11.1.1) with c = 2\/2- Hence 
(b) follows from the case of Proposition 11.1 in which L = ir[\/2] and c = □ 

Remark 11.3. Using the results of [8] one can give a bound on the number of non-nifty elements 
of Ok which is exponential in the degree of K. 

The proof of Theorem 1.3 will combine Theorem 11.2 with the following result, the proof of which 
will be extracted from [12]. 

Proposition 11.4. Let M be a closed, orientable hyperbolic 3-manifold, and let ^ be a positive 
real number. Let us write M = H^/Po for some torsion-free cocompact discrete subgroup Pq of 
Isom+(]HI'^), and let q : — > M denote the quotient map. Let C be a closed geodesic in M of length 
I < fi, andletX<TQ denote the stabilizer of q^"^ {C) . Suppose that for every ^Tq — X , for every 
X ^ X — {1} and for every P E H^, we have max((i(P, x • P),d(P, 7x7"^ • P)) > fi. Then there is 
an embedded tube about C having radius R, where R is defined by 

cosh /i — cosh 

(11.4.1) sinh^R-- 



cosh,/^ - 1 



Proof. The group X is cyclic and is generated by an element xq with translation length I. Let 
L = I + iO denote its complex length. It is a lemma due to Zagier [19, p. 1045] that if / and 9 are 
real numbers with < / < 'K^/?> then there exists an integer n > 1 such that 



AttI 

(11.4.2) cosh n/ — cos n0 < cosh W — 1 . 

" V ^ 

Let us set x = Xq, and let R be defined by (1.3.1), and let Z denote the radius- i? neighborhood of 
A. We claim: 

11.4.3. For every P £ Z we have d{P, x ■ P) < fi. 

To prove 11.4.3, let P £ Z be given and set D = dist(P, x • P). Since A is the axis of x, we 
may apply 2.5, with x playing the role of 7 in 2.5, to deduce that the distance from P to A is 
u;(Clengthn(x), Z?). Hence 

R > a;(Clength H(x) , D) = uj{nl + in9, D), 
which in view of (2.5.1) means that 

2 „ cosh D — cosh nl 



(11.4.4) sinh^i?> 



cosh nl — cos n9 
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It follows from (11.4.2) that in the right hand side of (11.4.4), the denominator is at most cosh(i/47ri/-\/3) — 



1, while the numerator is at least coshD — cosh( y 47rZ /\/3) . Hence 

cosh D — cosh . 
sinh^ R > 



coshy^M.i 

which with (1.3.1) implies that D < fi. This proves 11.4.3. 

We are required to prove that there is an embedded tube about C having radius R. This is 
equivalent to showing that for every 'j £ Tq — X we have 7 • Z n Z = 0. Suppose to the contrary 
that J ■ Z n Z contains a point P. Since P £ Z, it follows from 11.4.3 that d{P,x ■ P) < fi. Since 
-P £ Z,it follows from 11.4.3 that d{x-j~^-P, 7"^ -P) < fi. Hence we have d{P, 7x7"^ -P) < /j,. 
On the other hand, the hypothesis implies that m.ax(d(P, x ■ P),d{P, y-P)) > jU. This contradiction 
completes the proof. □ 

11.5. I will now turn to the proof of Theorem 1.3, which was stated in the introduction. 



Proof of Theorem 1.3. According to Theorem 11.2, K contains at most finitely many non-nifty 
elements. Let ri, . . . ,rAr be all the non-nifty elements of C C that do not lie in the interval 
[—2,2] C R C C; here is a non-negative integer. For each integer j with 1 < i < A, let Sj 
denote the set of all complex numbers L with strictly positive real part such that 2 cosh(L/2) = ±rj. 
For each j we have Sj 7^ since Tj ^ [—2, 2] C R; and all elements of Si have the same real part, 
say li > 0. We set e = min(0.3, h, . . . ,In)- (In particular, if A = we have e = 0.3.) 

Let M be any closed hyperbolic 3-manifold which has trace field K, has integral traces, and satisfies 
Hi{M; Tjg) = for s = 2, 3, 7. Let C be a primitive closed geodesic in M having length I < e. Let us 
write M = M'^/Tq for some torsion-free cocompact discrete subgroup Tq of Isom-|_(]HI^). According 
to 2.4, Tq is the isomorphic image under H of a cocompact (and torsion-free) subgroup F of SL2(C). 
Since Fq is torsion- free and cocompact it is purely loxodromic, and hence trace 7 ^ [—2,2] C R for 
any 7GF — {1}. It follows that any non-nifty element of Ok which is the trace of an element of 
F — {1} must be among the Tj. 

For some primitive element xq of F we have C = A/{II{xq)), where A denotes the axis of II{xq). 
Set r = trace xq and L = Clength(H(xo)). According to (2.5.2) we have r = ±2 cosh(L/2). On 
the other hand, we have ReL = length (H(xo)) = / < e. Hence for any j with 1 < j < A we have 
Re L < Ij. If r were non-nifty we would have r = Tj for some j; since 2 cosh(L/2) = itr, this would 
imply that L £ Sj and hence that ReL = Ij, a contradiction. Thus: 

11.5.1. The trace of xq is nifty. 



The conclusion of Theorem 1.3 asserts that there is an embedded tube about C having radius R, 
where R is defined by (11.4.1) with /i = 0.3. According to Proposition 11.4, it suffices to prove that 
if 7 is any element of F — (xq), we have 

(11.5.2) max{d{P,x ■ P),d{P,jxj'^ ■ P)) > (log3)/3 

for every non-trivial element x of (xq). Note that since 7 does not belong to {xq), it does not 
commute with xq. 
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We first prove (11.5.2) in the case where trace is swell. In this case, for each x G (xq) it follows 
from Proposition 6.3 that either (i) x and 7x^7"^ are independent elements of T, or (ii) and 
7x^7"^ are independent elements of T. In subcase (i), Theorem 2.2 gives 

1/(1 + exp(d(P, X • P))) + 1/(1 + exp(d(P, 7x^ • P))) < 1/2. 

If (11.5.2) is false for some x £ (xq), we have d{P,x ■ P) < (log3)/3 and d{P,jx^j^^ ■ P)) < 
4(log3)/3. Hence 

1/2 > 1/(1 + exp((log 3)/3)) + 1/(1 + exp(4(log 3)/3)) = 0.59 . . . , 

a contradiction. In subcase (ii), Theorem 2.2 gives 

1/(1 + exp((i(P,x2.p))) + 1/(1 + exp(d(P, 7x2. P))) < ^^2. 

If (11.5.2) is false for some x G (xo), we have d(P, x^ • P) < 2(log3)/3 and d(P, 7x^7^^ • P)) < 
2(log3)/3. Hence 

1/2 > 1/(1 + exp(2(log 3)/3)) + 1/(1 + exp(2(log 3)/3)) = 0.64 . . . , 

a contradiction. 

It remains to prove (11.5.2) in the case where trace xq is not swell. According to 11.5.1, trace xq is 
nifty. It therefore follows from Proposition 6.4 that for each x G (xq) the elements x^ and 7x^7^^ 
are independent elements in T. If (11.5.2) is false for some x G (xq), we have d{P,x^ ■ P) < log 3 
and (i(P, 7x^7"^ • P)) < log 3. Hence 

1/2 > 1/(1 +exp(log3)) + 1/(1 + exp(2(log3)/3)) = 1/2, 

a contradiction. □ 

The following lemma will be needed for the proof of Theorem 1.4. 

Lemma 11.6. Let T be a cocompact, torsion-free subgroup o/SL2(C) having integral traces, and 
suppose that Hi(T;7jp) = for p = 2,3 and 7. Let K denote the trace field ofV. Suppose that for 
every pair of elements x,y £T, either (x, y) has infinite index in T, or at least one of the elements 
X, y or xy has nifty trace in Ok- Then 0.183 is a Margulis number for n(r) < Isom-|-(]HI^). 

Proof. We must show that if x and y are non-commuting elements of T, then for every point P G 
we have max(d(P, x • P),d{P,y ■ P)) > 0.183. By hypothesis, either (x,y) has infinite index in T, 
or at least one of the elements x, y or xy has nifty trace in Ok- If {x,y) has infinite index in T, 
then (x, y, ) is a two-generator non-cocompact purely loxodromic subgroup of Isom+(]HI^), and is 
therefore free. Since x and y do not commute, (x, y, ) has rank 2, and so x and y are independent. 
By Theorem 2.2, it follows that 1 / {1 + exp{d{P, x - P))) + I / {1 + exp{d{P, y - P))) < 1/2, and hence 
that max((i(P, x • P), d(P, y-P)) > log 3. We may therefore assume that at least one of the elements 
X, y or xy has nifty trace in Ok- By symmetry we may assume that either x or xy has nifty trace 
in Ok- 

We define elements u,v € T as follows. If trace x is nifty we set u = x and v = y. If trace x is 
not nifty (so that trace(xy) is nifty) we set u = xy and v = x~^. In either case, trace u is nifty. 
Furthermore, in either case, since x and y do not commute, u and v do not commute. 

If trace u is swell, it follows from Proposition 6.3 that either u and vu'^v~^ are independent elements 
of r, or and vu^v'^ are independent elements of T. If tracer is nifty but not swell, it follows 
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from Proposition 6.4 that u'^ and vu'^v~^ are independent elements of T. Thus in any case there is 
a pair (X, Y) in the list 

(11.6.1) {x,yxS-^), {x^.yx^y-^), (x^yxV^), [xy^yxf), {{xyf,{yxf), {{xyf,{yxf) 
such that X and Y are independent elements of T. 

For each of the pairs {X^Y) in the list (11.6.1), X and Y are defined as words in x and y. In 
each entry in the list except {xy, (yx)^), the lengths of the words defining X and Y are both 
at most 6. In the case of the entry (xy, (yx)^), the lengths are 2 and 8. Hence if we set n = 
max(d(i-*, x ■ P), d{P, y ■ P)), then in each case we have either 

(11.6.2) max{d{P,X ■ P),d{P,Y ■ P) < 6fi 
or 

(11.6.3) d{P,X-P)<2fi and d{P,Y ■ P) < 8^. 
On the other hand, since X and Y are independent. Theorem 2.2 gives 

1 1 

+ — < 1/2. 



l+exp{d{P,X ■ P)) 1 + exp{d{P,Y ■ P)) 
Hence if (11.6.2) holds we have 

2 

(11.6.4) — - < 1/2, 

^ ^ 1 + exp(6/i) - ' ' 

and if (11.6.3) holds we have 

1 1 

(11.6.5 , r + z r<l/2. 

^ ' 1 + exp(2^) 1 + exp(8/i) " ' 

The left hand sides of (11.6.4) and (11.6.5) are monotonically decreasing in and respectively take 
the values 0.5002 . . . and 0.59 . . . when fj, = 0.183. Hence fi > 0.183, as required. □ 

11.7. I will now turn to the proof of Theorem 1.4, which was stated in the introduction. 



Proof of Theorem 1.4- Suppose that there is an infinite sequence (Mj)j>o of pairwise non-isometric 
closed, orientable hyperbolic 3- manifolds satisfying conditions (1) — (3) of the statement of the 
theorem. For each j write Mj = M^/rj,^'^ for some torsion-free cocompact discrete subgroup Fg'^ of 
Isom+(]HI'^). According to 2.4, Fg'^ is the isomorphic image under H of a cocompact (and torsion- 
free) subgroup F^-') of SL2(C). Then it follows from Lemma 11.6 that for each j there exist elements 
Xj, yj £ T^^^ such that the traces of xj, yj and xjyj are all non-nifty, and the subgroup A'^-'^ = {xj,yj) 
has finite index in F(j). Set Mj = B.^/A^^\ so that Mj is a finite-sheeted covering of Mj. 

Since by Theorem 11.2 there are only finitely many non-nifty elements in Ok, we may assume after 
passing to a subsequence that the sequences (trace Xj)j>o, (trace ?/j)j>o and trace(xjyj))j>o are 
constant. Thus, given any j > 0, we have trace Xj = trace rco, trace = trace yo and trace(xjyj) = 
trace(3;oyo)- It then follows from [24, Proposition 4.4.2] that for every word W in two letters we 
have trace W{xj,yj) = tiaceW{xo,yo). According to [10, Proposition 1.5.2], this implies that for 
some Aj G SL2(C) we have Xj = AxoA~^ and yj = AyoA'^. In particular Aj = AAqA~^, so that 
Mj is isometric to Mq for each j. 

Set D denote the diameter of Mq. Then since each Mj has a finite-sheeted covering isometric to 
Mo, each Mj has diameter at most D. 
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Let V denote the infimum of the volumes of all closed hyperbolic 3-manifolds; we have v > 0, for 
example by [19, Theorem 1]. Since each Mj has volume at least v, diameter at most D, constant 
curvature —1 and dimension 3, it follows from the main theorem of [22] that the Mj represent only 
finitely many diffeomorphism types. By the Mostow rigidity theorem, they represent only finitely 
many isometry types. This is a contradiction. □ 

12. Appendix: Proof of Lemma 10.2 
We begin with a few preliminary lemmas. 

Lemma 12.1. Let K be a degree-3 extension of Q, and let p be an element of Ok- Suppose that 
both /O^ — 2 and p — I are units in Ok- Let f G Q[^] denote the minimal polynomial of p over Q. 
Then there exist r, s G Z satisfying r^ — 2s^ = ±1, such that either 

(i) f{X) =X^ + {s- 2)^2 - (r + s + 2)X + (r + A), or 
(ii) f{X) = X^ + sX'^ -{r + s + 2)X + r. 

Proof. If p G Q, then since p^ — 2 and p — 1 are units in Ok, we have both p — 1 = ±1 and 
p2 — 2 = ±1, which is impossible. Hence p is a primitive element of Q, and / has degree 3. Since 
PGOk we have / G Z[X]. We write f{X) = X^ + bX'^ + cX + d, where b,c,de Z. 

Set F(X) = f{X + 1) G Z[X]. Then F is the minimal polynomial of p — 1 over Q. Since p — 1 is a 
unit in Ok, the constant term of F is ±1. Thus \ + b + c + d = /(I) = -F(O) = ±1. Hence: 

(12.1.1) Either 6 + c + d = or 6 + c + d = 2. 

Next, note that since /(p) = 0, we have p^ + cp = —(bp^ + d), and hence 

= (p3 + cpf - {bp^ + df = p6 + (2c - + (^2 _ 25^)^2 _ 

Thus if we set 

g[X) = X^ + (2c - b^)X'^ + (c^ - 2bd)X - d^ £ Z[X], 

then g{p'^) = 0. Now since p has degree 3 we cannot have p^ G Q. Hence p^ is itself a primitive 
element of Q, and g is therefore the minimal polynomial of p^. If we set G{X) = g{X + 2) G Z[X], 
it follows that G is the minimal polynomial of p^ — 2 over Q. Since p^ — 2 is a unit in Ok, the 
constant term of G is ±1. Thus 

8 + 4(2c - 6^) + 2(c2 - 2bd) - d^ = g{2) = G(0) = ±1. 

Hence: 

(12.1.2) 4(2c - 6^) + 2{c^ - 2bd) - = -8 ± 1. 

According to (12.1.1) we have either c = —{b + d) oi c = —{b + d + 2). Let us first consider the 
case in which c = —{b + d). In this case, we have 

4(2c - b^) + 2{c^ - 2bd) -d^ = -8(6 + d) - 46^ + 2(6 + df - 4bd - d^ 

= (d- 4)2 -2(6 + 2)2 -8. 

Hence according to (12.1.2), we have 

(d- 4)2 -2(6 + 2)2 = ±1. 
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If we set r = d - 4 and s = 5 + 2, it now follows that - 2s^ = ±1 and that f{X) = + bX^ + 
cX + 1 = X^ + {s- 2)X2 - (r + s + 2)X + (r + 4). Thus Alternative (i) of the conclusion of the 
lemma holds in this case. 

We now turn to the case in which c = —(6 + d + 2). In this case, we have 

4(2c - 6^) + 2{c^ - 2hd) -(f = -8(6 + d + 1) - 46^ + 2(6 + d + 2f - 4bd - 



Hence according to (12.1.2), we have 



d^ - 26^ 



-2h^ = ±1. 



If we set r = d and s = 6, it now follows that — 2s^ = ±1 and that f{X) = X^ + 6X^ + cX + d = 
X^ + sX^ — (r + s + 2)X + r. Thus Alternative (ii) of the conclusion of the lemma holds in this 
case. □ 

Lemma 12.2. Let G denote the function defined for x ^ and for all real y by 

4 97 1 8 

(12.2.1) G{x, y) = (1 + yf + -(1 + yf - 4y - -^y^ - -y(l + y). 

Then for every x < and every y with 1/2 < \y\ < 2, we have G{x,y) > {y — 1)^ — 27y^/x^. 
Furthermore, for every x with x > 68 and for every y with 1.31 < \y\ < 2, we have G{x, y) > 0. 

Proof. The definition of G{x, y) may be rewritten as 

(12.2.2) G{x, y) = iy- if - ^ (1 + y){y -h{2-y)- '^y\ 

X 2 x^ 

For -2 <y < -1/2, and for 1/2 < y < 2, it is clear that 

(12.2.3) (l+y)(y- ^)(2-y) >0. 

The first assertion of the lemma follows immediately from (12.2.2) and (12.2.3). 
Now suppose that x > 68 and 1.31 < |y| < 2. In this case it follows from (12.2.2) and (12.2.3) that 

G(x,y) >G(68,y) 

(12.2.4) =l(y3^(31_^^ 2_71 

> H{y)/17, 

where H(y) is the one-variable polynomial function defined by 

//(y) = y3 + (15.4)y2-(35.5)y + 18. 

The critical points of H occur at —11.31 and 1.04 . . .. The former is a local maximum and the 
latter is a local minimum. In particular, H is monotone decreasing on the interval [—2, —1.31] 
and monotone increasing on the interval [1.31,2]. Hence for —2 < y < —1.31 we have H{y) > 
H{-1.31) = 88.6. . ., and for 1.31 < y < 2 we have H{y) > F(1.31) = 0.171 .... In particular we 
have H(y) > whenever 1.31 < |y| < 2. In view of (12.2.4), this establishes the second assertion 
of the lemma. □ 

Lemma 12.3. Let f £ Z[X] be given. Suppose that there exist r,s g7j satisfying r^ — 2s^ = ±1, 
\s\ > 70, and \r\ > 99, such that either 

(i) f{X) = X^ + {s- 2)X^ - (r + s + 2)X + (r + 4), or 
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(ii) f{X) =X^ + sX'^-{r + s + 2)X + r. 
Then the discriminant of f is positive. 

Proof. Since — = ±1, we have \r/s\ = VY±~s~^. Since \s\ > 70, it foUows that 
(12.3.1) 1.41 < J2 - 1/4900 < \r/s\ < J2 + 1/4900 < 1.42. 



Let us write f{X) = X^ + bX'^ + cX + d. If Alternative (i) of the hypothesis holds we have b = s — 2, 
d = r + 4 and c = —(6 + d). If Alternative (ii) holds we have b = s, d = r and c = —(6 + d + 2). 

We set A = d/b. We claim that 

(12.3.2) 1.31 < |A| < 1.6. 

If Alternative (ii) holds then A = r/s, and (12.3.2) follows from (12.3.1). Now suppose that 
Alternative (i) holds. In this case we have 

d r + 4 rf 4\ / 2\"^ 

Since \s\ > 70 and |r| > 99, it follows from (12.3.1) and (12.3.3) that 

1.4lfl + — ^ (1 -—)< lAI < 1.42(^1 - (1 + —), 
V 70y ^ 99^ ' ' V 70y ^ 99^' 

which implies (12.3.2). Thus (12.3.2) holds in all cases. 
Let A denote the discriminant of /. We have 

(12.3.4) A = b^c^ -Ac' -Ab^d-27d^. 

In the case where Alternative (i) of the hypothesis holds, (12.3.4) gives 

A = 62(6 + df + 4(6 + df - Ab^d - 27d^ - 18bd{b + d), 

and hence 

(12.3.5) ^ = G(6,A), 

where G is the function defined by (12.2.1). Since s > 70 by the hypothesis, we have |6| = |s — 2| > 
|s| — 2 > 68 in this case. Furthermore, by (12.3.2) we have 1.31 < |A| < 1.6. Hence Lemma 12.2 
implies that G(6, A) > 0, which by (12.3.5) shows that A > in this case. 

In the case where Alternative (ii) of the hypothesis holds, (12.3.4) gives 

A = 6^(6 + d + 2)2 + 4(6 + d + 2)^ - 4b^d - 27d'^ - 18bd{b + d + 2), 

and hence 

A 4 27 18 2 

(12.3.6) = (1 + A + 2/6)2 ^ n^x + 2/bf - 4A - -^A^ - -^A(l + A + -). 
b^ 6 6^ 6 

Comparing (12.3.6) with the definition of the function G in (12.2.1), we find that 

Since A = d/b = r/s in this case, it follows from (12.3.1) that 
(12.3.8) 1.41 < |A| < 1.42. 
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In particular, since |6| = |s| > 70 in this case, it follows that 

(12.3.9) 1.38 < |A + 2/6| < 1.45. 

Since in addition to (12.3.9) we have \b\ > 70 > 68, it follows from Lemma 12.2 that 

(12.3.10) G(6,A + 2/6) > if 6 > 0, 
and that 

(12.3.11) G{b, A + 2/6) > (A - 1)^ - 27AV&^ if 6 < 0. 

In the subcase where 6 > 0, so that b > 70, we find from (12.3.7) and (12.3.2) that 

A ^.^ ^ 2, 8 ,108 36,, 
^-G(M+^)>^ + (^ + ^)A 

> ^(8 - (108/70^ + 36/702)(1.6)) 
= (7.98... )/6 > 0. 

Since G{b, A + 2/6) > by (12.3.10), it now follows that A > in this subcase. 
In the subcase where 6 < 0, (12.3.7) and (12.3.8) give 

^ X 2, 8 108 36 ,^ ^ 2 . 

^-G(i,.A+-)>---^-^(l + 1.42+-) 

> -0.134. 

But in this subcase, (12.3.11) and (12.3.8) give 

G(6, A + 2/6) > (0.41)2 _ 27(1.4lV702) > 0.15. 
Hence we have A > in this subcase as well. □ 

We are now ready to give the 

Proof of Lemma 10.2. Let / G Q.[X] denote the minimal polynomial of p over Q. According to 
Lemma 12.1, there exist r,s satisfying — 28^ = ±1, such that either (i) f{X) = fr,sfl{X) = 
X3 + (s - 2)^2 - (r + s + 2)X + (r + 4), or (ii) f{X) = fr,s,2{X) = X^ + sX^ - {r + .s + 2)X + r. 

In particular / has degree 3, and hence K = Q(p). Since K H, we have p ^ R. Thus p and p 
are distinct roots of /. Since / has degree 3 it must also have a real root a. 

Let A denote the discriminant of /. By definition we have 

(12.3.12) A = (p - af{p - af{p - pf = -4Im(/j)2 • \p - < 0. 

The equation — 2.s^ = ±1 satisfied by r and s is a special case of Pell's equation. Solving it by 
the method given in [21, Section 7.8], we find that either (r, s) is one of the eighteen pairs 

(12.3.13) (±1,0), (±3, ±2), (±7, ±5), (±17, ±12), (±41, ±29), 

or else we have \r\ > 70 and |s| > 99. In the latter case. Lemma 12.3 asserts that A > 0, which 
contradicts (12.3.12). 

Hence / must be of the form fr,s,o or /r,s,2 where (r, s) is one of the pairs listed in (12.3.13). Of 
these thirty-six possibilities for /, twenty- nine have positive discriminant and are therefore ruled 
out by (12.3.12). (The discriminant may be calculated by the formula (12.3.4), taking a = s — 2, 
b = r + s + 2 and c = r + A when / = /r,s,0) and a = s.,b = r + s + 2 and c = r when / = fr,s,2-) 
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The remaining possibilities for / are /-i,o,o, f -3,2,0, f 7,5,0, f -1,0,2, f -3,2,0, f -3,-2,0, and f -7,-5,2- 
The imaginary roots of / are equal to 



-(1.573. 


..)±i{M8. 


■■) 


if 


/ = 


/-1, 0,0) 


-(.662. 


..)±i(.562. 


■•) 


if 


/ = 


f -3,2,0, 


+(1.380. 


..)±i(.054. 


■■) 


if 


/ = 


f7,5,0, 


-(.662. 


..)±i(.562. 


■■) 


if 


/ = 


f -1,0,2, 


-(1.539. 


..)±i(.368. 


■■) 


if 


/ = 


f -3,2,2, 


+(.303.. 


) ±i(1.435. 


■•) 


if 


/ = 


f -3,-2,2, and 


+(1.784.. 


)±i(1.307. 


■■) 


if 


/ = 


f -7,-5,2- 



Since the imaginary roots of all of these polynomia' 



s other than f7,5,o{X) = + 3^^ - UX + 11 



have imaginary parts of absolute value greater than 0.36, the conclusion of the lemma follows. □ 
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